Gene TM1040_3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3321 
Symbol 
ID4075726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp329654 
End bp331141 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content62% 
IMG OID638004829 
Productaldehyde dehydrogenase (acceptor) 
Protein accessionYP_611555 
Protein GI99078297 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.566743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.20798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATT GGAAAGCGGC GGCTTCGGCT CTGTATGATG GCGGGTTTCG CCCGATGTTC 
ATCAACGGAA AATGGTGTGA ATCGCAATCC GGCGAGGTGA TCGAGGCCCG TAATCCTGCG
AGCGGCGCTT TGCTGGCAAC GGTGCCCAAA GGCGGCGCAG CGGATGTTGA TGCGGCTGTC
GCGGCGGCGC GCGCGGCTTT TGAGGGGCCG TGGTCGAAAT GGACCCCCTT TGAGCGTCAG
GCCCTGCTGC TCCGGATCGC GGATCGCTTT GAGGCAGAGT GGGAAACGCT TTGTCTGTCC
GACACGCTTG ATATGGGGAT GCCGATTCAG CGCACGCTCG CCAACAGCCG TCGTGTTCTG
GGGATGCTGC GCTTTTATGC GGGGCAGGCA GTCACCATCC ATGGCCACAC GATCCCCAAC
TCCTTTCCGG GGGAGATCCA TTCCTCGACC GTGCGCGAAC CGGTGGGCGT GGTGGGCGCG
ATCATTCCGT GGAACGCGCC GATCGCGGGT TCGATCTGGA AGATCGCGCC AGCCATCGCA
ACCGGCTGCA CGGTGGTGCT GAAACCTTCC GAGGAGGCCT CTTTGACGGT GCTGATGATT
GCCCGGATCA TGCAGGAAGC GGGCCTGCCC GATGGGGTTT TGAATATCGT CACCGGGTAC
GGCGCCGCGG CGGGTGCGGC GCTGGCGGCG CATTCTGGTG TCGACAAGAT CGTCTTTACC
GGCTCCACCG CGACAGGGCA GGCCATCGCC CGCGCGGCAA CCGGAAACCT CAAACGGGTT
TCGCTGGAGC TTGGCGGCAA ATCCCCGGTG ATTGTCTGCC GGGATGCAGA CATTGAAAAA
GCGGTGCCCG TCGCAGCCAT GGCGGTGTTT GCAAACTCGG GCCAGATCTG CATCGCCGGG
TCGCGGCTGT TTGTGGCGCG CGAGATCCAC GACGAATTTG TGCGACGTGT TGCGGAATTC
GCTGCCAATC TGCGTATTGG TCACGGCATC GAAGAGAGCA CGGACGTAGG CCCGATCATC
TCTGCGCGGC AGGCAGAGCG TATTGCGGGC TATCTCGCTG CCGGCCCAAG CGAAGGTGCG
GAGATCCTGA CCGGTGGCGC ACGGGTGAAA GGCGCGGGTT TTGAAGGCGG ACACTTCATC
GAGCCAACTG TGTTTGGTGG CGTCACGGAC GAGATGTCCA TCGCACGCGA GGAGATCTTT
GGTCCGGTGA TCTCGGCGCT GCCGTTTGAC AGTCTCGATG AGGTGGTCGA GCGGGCCAAC
GCGACACCTT ATGGGTTGGC TGCTGGTGTG TTCTCGACCC ACCTCGGGAC CGCGCACAAA
TTGGCACATC GCCTGAAGGC GGGATCAGTC TGGGTCAATA TGTACCACGC GATCGACCCT
GCGGTGCCCT TCGGAGGCGT CAAGATGTCA GGCTACGGGC GCGAAGGCGG CACCGAGCAC
ATGGAAGAAT ACCTCGATAC CAAGGCGATC TGGATCAACA CGGACTGA
 
Protein sequence
MTDWKAAASA LYDGGFRPMF INGKWCESQS GEVIEARNPA SGALLATVPK GGAADVDAAV 
AAARAAFEGP WSKWTPFERQ ALLLRIADRF EAEWETLCLS DTLDMGMPIQ RTLANSRRVL
GMLRFYAGQA VTIHGHTIPN SFPGEIHSST VREPVGVVGA IIPWNAPIAG SIWKIAPAIA
TGCTVVLKPS EEASLTVLMI ARIMQEAGLP DGVLNIVTGY GAAAGAALAA HSGVDKIVFT
GSTATGQAIA RAATGNLKRV SLELGGKSPV IVCRDADIEK AVPVAAMAVF ANSGQICIAG
SRLFVAREIH DEFVRRVAEF AANLRIGHGI EESTDVGPII SARQAERIAG YLAAGPSEGA
EILTGGARVK GAGFEGGHFI EPTVFGGVTD EMSIAREEIF GPVISALPFD SLDEVVERAN
ATPYGLAAGV FSTHLGTAHK LAHRLKAGSV WVNMYHAIDP AVPFGGVKMS GYGREGGTEH
MEEYLDTKAI WINTD