Gene Gdia_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3331 
Symbol 
ID6976774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3642566 
End bp3643816 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID643392845 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_002277673 
Protein GI209545444 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.150101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.101451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA TTGTCCTTCA TGAGGATATT CCGGAATCCC TGCTGCCCGG TGGCGAGGCC 
GCGGCGGCCG CGACCCACAC GGTGGAAATC GATTCCCACG CCCTGAATTT CGGCCCGCAG
CATCCGTCCG CCCATGGTGT GCTGCGCCTG GTCCTGGAAA TGGAGGGCGA GGTCGTCGCC
CGCGCCATTC CGCATATCGG CCTGCTGCAT CGCGGCACCG AAAAGCTGAT CGAATACAAG
ACCTATCCCA AGGCCCTGCC GTATTTCGAC CGGCTCGATT ACGTCTCGCC GATGTGCGAG
GAGCAGGCTT TCGCGCTGGC GACCGAAAAG CTGCTGGGGA TCGACATTCC CGATCGCGCG
AAATGGATTC GCGTGATGTT CGCGGAAATC ACCCGGATCC TGAACCATAT CCTGAACCTG
ACGGCGCTCG GGCTCGATTG CGGCGCGGTG ACCCCGGCGC TGTGGGGCTA CGAGGAACGC
GAAAAGCTGA TCGAGTTCTA CGAGGCCGCG TCGGGCGCCC GGTTTCATGC CAATTACTTC
CGTCCCGGCG GGGTCTCGCG TGACCTTCCG GCGGGGCTGG AGGATCGGAT CGCCGAATGG
GCGCGCCAGT TCCCGGCCTG GATCGACGAT CTGGAATCGC TTCTGACCAA CAACCGGATC
TGGAAGCAGC GCACGGTCGG GATCGGCATC TTCACGACCG AGCAGGCGCT GGCCTGGGGC
TTCAGCGGTC CGTGCCTGCG CGCCTCGGGC GTGCCGTGGG ACCTGCGCCG CGCCCAGCCC
TATGACAATT ACGACAAGGT CGAGTTCAAC ATCCCCGTCG CGCGCCAGGG CGATTGCTAC
GACCGCTACC TGATCCGCGT CGCGGAAATG CGCGAGAGCG TGCGGATCGT CGAACAGTGC
CTGGCCCAGA TGAAGCCCGG CCCGATCAAG ATCCAGGACC ACAAGATCAC GCCGCCGCCC
CGGCGCGAGA TGAAGCGGTC GATGGAAGCC CTGATCCATC ATTTCAAGCT GTTCACGGAA
GGGTACCACG TCCCGCCGGG GGCAACCTAT ACGGCGGTCG AAAGCCCCAA GGGCGAATTC
GGGGTCTATC TGGTCGCGGA TGGCAGCAAC CGGCCCTACC GGTGCAAGAT CCGGCCGACC
GGCTTCGCCC ATCTGCAGGC CATCGACGAG ATGTCGCGCC GCCACATGCT GGCCGACGCG
GTGGCGATCA TCGGGTCGCT GGACCTGGTG TTCGGCGAGA TTGACAGGTG A
 
Protein sequence
MSDIVLHEDI PESLLPGGEA AAAATHTVEI DSHALNFGPQ HPSAHGVLRL VLEMEGEVVA 
RAIPHIGLLH RGTEKLIEYK TYPKALPYFD RLDYVSPMCE EQAFALATEK LLGIDIPDRA
KWIRVMFAEI TRILNHILNL TALGLDCGAV TPALWGYEER EKLIEFYEAA SGARFHANYF
RPGGVSRDLP AGLEDRIAEW ARQFPAWIDD LESLLTNNRI WKQRTVGIGI FTTEQALAWG
FSGPCLRASG VPWDLRRAQP YDNYDKVEFN IPVARQGDCY DRYLIRVAEM RESVRIVEQC
LAQMKPGPIK IQDHKITPPP RREMKRSMEA LIHHFKLFTE GYHVPPGATY TAVESPKGEF
GVYLVADGSN RPYRCKIRPT GFAHLQAIDE MSRRHMLADA VAIIGSLDLV FGEIDR