Gene Gdia_2400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2400 
Symbol 
ID6975830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2661629 
End bp2663020 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content66% 
IMG OID643391923 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002276765 
Protein GI209544536 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACG CGACGATTAA TCCGTTTACG AACGAACAGA TCAAGACATT CCCGAATTCG 
ACCGATTCCG AGGTCGATGC CGCGCTGGAC GCGGCGCACG CGGCGTTCCT GTCCTGGCGC
GAAACCTCGT TCGCCGAACG GGCGAAGATC ATGCAGAAGG CCGCCGACAT CCTGCGCCGC
GACAGCGAGA AATATTCCGC CCTGCTGACG CTGGAAATGG GCAAGCTGAT TTCCGAAGCG
CGCGCGGAAA CCGAACTGTC GGCCGCGATC TTCGAATATT ACGCTAAGAA TGCGGAAAAG
CTGCTGGCGC CGGAGAGCCT GCCGGTCGCC AGCAAGGACG AAGGCCGCGC CAGGATCATC
TTCCAGCCGC TGGGCATCCT GCTGGCCGTC GAGCCGTGGA ACTTCCCGTT CTACCAGGTG
GCCCGCATCA TCGCGCCGCA GCTTTCCGCC GGCAACACGG TCGTGCTGAA GCACGCGTCC
AACGTGCCGC AATGCGCCGA GGCCATGGAC CGGCTGATGA TCGAGGCCGG CCTGCCGAAG
GGCGGGTTCC GCAACCTCTA TCCCACGCAT GACCAGGTCT CGCGCATCAT CGCCGACCAC
CGCGTGCGCG GCGTGGCGCT GACCGGTTCG GAAGGGGCGG GGGCCAAGGT CGCATCCGGT
GCCGGGGCGG CGCTGAAGAA ATCGACCATG GAACTGGGCG GGTCGGACGC CTTCATCGTG
CTGGAGGACG CGGACCTCGA AAAGACGATC AAGTGGGCCG TGTTCGGCCG CCACTGGAAC
GCCGGGCAGG TCTGCGTCTC GGCCAAGCGG ATGATCGTCG TGGACAAGGT GTACGACGCG
TTCGCCAAGG GCTATCGCGA AGGCGTGGCG AAGCTGAAGA TGGGCGACCC GATGGACCCG
TCCACCACGC TGGCGCCGCT GTCGTCGCAG GGTGCGGTCA ACGACCTGAA GAAGCAGGTC
GAAGGGGCGG TCGCCGCCGG GGCGAAGGCC GAGGAGATTC CGCTGCCGTT GCCCAATGCG
GGCGCCTTCT TCCGCCCGGT GATCCTGTCG GACGTCGCGC ATGACAATCC GGCCCGGCGC
GAGGAATTCT TCGGCCCCGT CACCCTGCTG TTCCGGGCGA AGGACGAGGC GGATGCGATC
CGCATCGCCA ACGACTCGCC CTATGGCCTG GGCGGCTCGG TCTTCACCCG GGATGAAGCG
CGCGGCGAGG CCGTGGCCGC GCAGATCGAG ACCGGCATGG TCTATGTCAA CCATCCCACG
ATGGTGAAGG CCGACCTGCC GTTCGGCGGC GTGCTGCGCT CCGGCTACGG GCGCGAGCTG
ATCGGGCTGG GCATCAAGGA ATTCGTCAAT GCCAAGCTGA TCGACGTCGT GGACATCGAT
GCGCCGTTCT GA
 
Protein sequence
MAYATINPFT NEQIKTFPNS TDSEVDAALD AAHAAFLSWR ETSFAERAKI MQKAADILRR 
DSEKYSALLT LEMGKLISEA RAETELSAAI FEYYAKNAEK LLAPESLPVA SKDEGRARII
FQPLGILLAV EPWNFPFYQV ARIIAPQLSA GNTVVLKHAS NVPQCAEAMD RLMIEAGLPK
GGFRNLYPTH DQVSRIIADH RVRGVALTGS EGAGAKVASG AGAALKKSTM ELGGSDAFIV
LEDADLEKTI KWAVFGRHWN AGQVCVSAKR MIVVDKVYDA FAKGYREGVA KLKMGDPMDP
STTLAPLSSQ GAVNDLKKQV EGAVAAGAKA EEIPLPLPNA GAFFRPVILS DVAHDNPARR
EEFFGPVTLL FRAKDEADAI RIANDSPYGL GGSVFTRDEA RGEAVAAQIE TGMVYVNHPT
MVKADLPFGG VLRSGYGREL IGLGIKEFVN AKLIDVVDID APF