Gene Gdia_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2370 
Symbol 
ID6975800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2626018 
End bp2627520 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content64% 
IMG OID643391895 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002276737 
Protein GI209544508 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCATCG ATCAGGAACC TCCCTTCAAA TCCCGTTACG GAAACTATAT CGGCGGCGAA 
TGGACGTCCC CCGTGAAGGG CGCGTACATG AAGAACACGT CCCCGGTGGA CGGGCGGGTC
CTGTGCGAAG TGCCGCAATC GACGGCCGAG GATGTCGAAC TGGCGCTGGA CGCGGCGCAC
AAGGCCTTCA AATCCTGGGC CCATACCTCG CCGGCCGAGC GGTCGCGGGT GCTGCTGAAG
GCGGCGGACC GGATGGAGGC GAATCTCGAC CTGCTGGCCC GGGCCGAGAC CTGGGACAAC
GGCAAGCCGA TCCGCGAGAC CCTGGCGGCG GATATTCCGC TGGCCATCGA CCATTTTCGC
TATTTCGCCG GGTGCATCCG CGCCCAGGAA GGCGCGCTGA GCGAAATCAA CGACACGACC
ATCGCCTATC ATTTCCATGA ACCGCTGGGC GTGGTCGGCC AGATCATTCC GTGGAACTTC
CCGATCCTGA TGGCGTCGTG GAAGCTGGCC CCCGCCCTGG TCGCCGGCAA CTGCGTCGTC
ATGAAGCCCG CCGAAACCAC CCCCGCCAGC ATCCTGGTGC TGATGGAACT GATCGGCGAC
CTGTTCCCGG CGGGCACCCT GAACATCGTC AACGGCCTGG GCCGCGATGT CGGCGCGGCG
CTGTCCACCA GTTCGCGCAT CGCCAAGATC GCCTTCACCG GATCGACCCC GACCGGCAAG
ATGATCGCGC ACGCGGCGGC GGAAAACCTG ATCCCGGCCA CGCTGGAACT GGGCGGCAAG
TCGCCCAATA TCTTCTTTGC CGACGTTATG GACCAGGACG ACGATTACCT GGACAAGGCG
ATCGAGGGCT TCACGATGTT CGCCCTGAAC CAGGGCCAGA TCTGTTCCTG CCCCAGCCGC
GCGCTGGTTC ATGAATCGAT CTACGAGCGG TTCATGGAAC GCGCCCTGCC CCGCGTGAAG
GCCATCCGCC ACGGCAACCC GCTGGACCCG CAGACCATGA TGGGGGCCCA GAACTCCAGC
ATGCAGGAAA ACAAGATCCT GGAATATATC GGCATCGGCA AGGACGAGGG CGCGGAACTG
CTGACCGGCG GCGCCAAGCC CGACCTGGGG GCCGCGTTCA ACGACGGCTT CTACGTCCAG
CCGACGGTGT TCCGCGGCCA TAACAAGATG CGGATTTTCC AGGAGGAAAT CTTCGGGCCG
GTGCTGGCCG TGACCACCTT CAAGACCGAG GAAGAGGCGC TGGCGATCGC CAACGACACG
CCGTTCGGCC TGGGGTCCGG CGTGTGGTCG CGCAACGCCA ACATCTGCTA TCGCATGGGG
CGCGGCCTGG AAGCGGGGCG GGTCTGGATC AATTGCTACC ATGCCTATCC CGCGCACGCG
GCGTTCGGCG GATACAAGAA GTCCGGCATC GGCCGCGAAA CGCACAAGAT GGTGCTGGAA
CACTACCAGC AGACGAAGAA CATGCTGGTC AGCTACAGCG AGAAAAAGCT CGGATTCTTC
TGA
 
Protein sequence
MGIDQEPPFK SRYGNYIGGE WTSPVKGAYM KNTSPVDGRV LCEVPQSTAE DVELALDAAH 
KAFKSWAHTS PAERSRVLLK AADRMEANLD LLARAETWDN GKPIRETLAA DIPLAIDHFR
YFAGCIRAQE GALSEINDTT IAYHFHEPLG VVGQIIPWNF PILMASWKLA PALVAGNCVV
MKPAETTPAS ILVLMELIGD LFPAGTLNIV NGLGRDVGAA LSTSSRIAKI AFTGSTPTGK
MIAHAAAENL IPATLELGGK SPNIFFADVM DQDDDYLDKA IEGFTMFALN QGQICSCPSR
ALVHESIYER FMERALPRVK AIRHGNPLDP QTMMGAQNSS MQENKILEYI GIGKDEGAEL
LTGGAKPDLG AAFNDGFYVQ PTVFRGHNKM RIFQEEIFGP VLAVTTFKTE EEALAIANDT
PFGLGSGVWS RNANICYRMG RGLEAGRVWI NCYHAYPAHA AFGGYKKSGI GRETHKMVLE
HYQQTKNMLV SYSEKKLGFF