Gene TM1040_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2781 
Symbol 
ID4076549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2939877 
End bp2941073 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID638008106 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_614775 
Protein GI99082621 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.409479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG GGGGCAGCGC CCCCACCCCG GCTCACCATA GGAACATGCT GATGTCTCTC 
ATTGGAAACT GGTCTTATCC GACCGCAATC AAATTCGGCG CAGGCCGGAT CAAGGAACTG
GCCGATGCTT GCGCGCAAGC CGGGATCAAA AAGCCGCTCT TGGTCACCGA CAAGGGGCTT
GCAGATCTGC CCGTCACTCA ATCGACGCTC GATATCATGG AGGCCGCAGG CCTTGGGCGC
GGGATGTTTT CTGAGGTCGA CCCCAACCCG AACGAGAAAA ACCTCGACGC GGGTGTTGCG
GCCTACAAGG CAGGCGGCCA TGACGGTGTG ATCGCCTTTG GCGGCGGCTC CGGCCTCGAT
CTGGGCAAAA TGGTTGCGTT CATGGCGGGC CAGACCCGCC CGGTTTGGGA TTTTGAGGAC
ATCGGCGACT GGTGGACCCG CGCGGACGCG GATGCGATCG CCCCGATCAT TGCCGTGCCG
ACCACCGCGG GCACCGGATC TGAGGTCGGG CGTGCCTCTG TCATCACCGA TAGCGCCACC
CACCAGAAAA AGATCATCTT CCACCCCAAG GTTCTGCCCA CCGTGGTGAT TTGCGATCCG
GAGCTTACCG TCGGGATGCC CAAATTCATC ACTGCCGGCA CCGGGCTTGA TGCCTTTGCC
CATTGCGTCG AGGCGTTTTC CTCGCCGCAC TACCACCCGA TGTCACAGGG TATGGCGCTC
GAGGGTATGC GCCTGGTCAA GGACTACCTT CCGCGCGCTT ATGCGGACGG CACCGACATT
GAGGCGCGCG CGCACATGAT GTCTGCGGCT GCCATGGGCG CCACCGCGTT CCAAAAAGGT
CTTGGCGCGA TTCACGCCAT GAGCCACCCG ATTGGCGCGC ATTTCAACAC GCACCACGGC
ACCACCAACG CGGTCTGCAT GCCTGCAGTG CTGGAATTCA ACGCGTCCGA GATTTCCGAA
CGCTTTGACA TGGCAGCGGC CTACCTCGGG ATCGAGGGCG GCTTTGAGGG CTTCAAGGCC
TTCGTGCAAG AGTTCAACGA CAGCCTCGGC ATCCCGCGCG GCCTGTCTGC GTTGGGCGTG
ACCGAAGAGT CGATCCCGGA GCTGGTCAAA GGCGCGATCA TTGATCCCAG CTGCGGCGGC
AATCCCGTCA AGCTGACTGA GGAAAACCTC ACCCAGCTGT TCAAAGCCGC GCTTTGA
 
Protein sequence
MASGGSAPTP AHHRNMLMSL IGNWSYPTAI KFGAGRIKEL ADACAQAGIK KPLLVTDKGL 
ADLPVTQSTL DIMEAAGLGR GMFSEVDPNP NEKNLDAGVA AYKAGGHDGV IAFGGGSGLD
LGKMVAFMAG QTRPVWDFED IGDWWTRADA DAIAPIIAVP TTAGTGSEVG RASVITDSAT
HQKKIIFHPK VLPTVVICDP ELTVGMPKFI TAGTGLDAFA HCVEAFSSPH YHPMSQGMAL
EGMRLVKDYL PRAYADGTDI EARAHMMSAA AMGATAFQKG LGAIHAMSHP IGAHFNTHHG
TTNAVCMPAV LEFNASEISE RFDMAAAYLG IEGGFEGFKA FVQEFNDSLG IPRGLSALGV
TEESIPELVK GAIIDPSCGG NPVKLTEENL TQLFKAAL