Gene Sare_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1077 
Symbol 
ID5704345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1208621 
End bp1209634 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content70% 
IMG OID641270592 
Productshort chain dehydrogenase 
Protein accessionYP_001535976 
Protein GI159036723 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00107266 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGCGCG GGCTACGGGA GATTCAGGTA GCGGTGGTCA CGGGTGCCAG CGGCGGCGTC 
GGGCGGGCCA CCGTACGGCA GCTGGCGCGG CCGGGGATCG CGATCGCCCT GTTGGCCCGC
GGTCGCACCG GCCTCGACGC CGCGGCCGAG GACGTCCGCT CCGCCGGTGG CCACGCCATG
CCGATCGAGG TGGACATGGC CGACTTCGAC CAGGTCGCCG CCGCCGGTCA GCGCGTCGAG
GACGAACTGG GGCCGATCGA CCTGTGGATC AACGTGGCCT TCAGTTCGAT CTTCGCACCC
TTCATGCAGA TTCGGCCCGA GGAGTTCCGC CGCACCGCTG AGGTCTCATA CCTCGGTTAC
GTCTACGGGA CACGGGTGGC GTTGGATCAC ATGACCCGGC GCGATCGGGG CACCATCGTG
CAGGTCGGGT CGGCCTTGGC CTACCGGGGA ATTCCCCTGC AGTCCGCCTA CTGCGGGGCC
AAGCACGCCA TCGTGGGGTT CACCGAGTCA CTGCGCTGTG AGTTGCTGCA CGACAAGAGC
AACGTCAAGG TCACCATGGT GCACCTGCCC GCGATGAACA CACCACAGTT CTCGTGGCTG
CTGTCCCGGC TGCCACGGCA CGCCCAGCCG GTCCCGCCCA TCTACGAGCC GGAGGTCGCC
GCCCGTTCCA TCGTCGCCGC CGCGGCCCGG CCCGGCCGGC GAGCGTACTG GGTGGGTACC
CCCACGGCGC TGACCATCGT GGGTAACCGT CTGGTTCCGG GTCTGCTGGA TCGCTACCTG
GGCCGGACCG GCTACCGCTC GCAGCAGACC GACCAGCCCG TCGACCCGGA CCAGCCGGCG
AACCTGTGGC AGCCGGTCGA CGGACCGGGC GGCCACGACC ACGGCGCGCA CGGCGCGTTC
ACCGACCGGT CACTGCGGCA CAGCCCGCAG GCGTGGCTGT CCCGGCACCG GATGGTCTCG
GTAGCGGGAG TGGCCGGGCT GTTGTTCGGC GTTCTCGCCT GGCGTCGACA CTGA
 
Protein sequence
MGRGLREIQV AVVTGASGGV GRATVRQLAR PGIAIALLAR GRTGLDAAAE DVRSAGGHAM 
PIEVDMADFD QVAAAGQRVE DELGPIDLWI NVAFSSIFAP FMQIRPEEFR RTAEVSYLGY
VYGTRVALDH MTRRDRGTIV QVGSALAYRG IPLQSAYCGA KHAIVGFTES LRCELLHDKS
NVKVTMVHLP AMNTPQFSWL LSRLPRHAQP VPPIYEPEVA ARSIVAAAAR PGRRAYWVGT
PTALTIVGNR LVPGLLDRYL GRTGYRSQQT DQPVDPDQPA NLWQPVDGPG GHDHGAHGAF
TDRSLRHSPQ AWLSRHRMVS VAGVAGLLFG VLAWRRH