Gene Gdia_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3333 
Symbol 
ID6976776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3644488 
End bp3645813 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content66% 
IMG OID643392847 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_002277675 
Protein GI209545446 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.240845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.194232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAGG ACAAGGACCG GATCTTCACC AACCTGTATG GTCAGCAGGA CTGGCGCCTG 
GCCGGCGCCC GGGCGCGCGG CGACTGGGAC GGCACGGCCG AGATCATCGC CCGCGGGCGC
GATGCGATCA TCAACGAGAT GAAGGCGTCC GGCCTGCGCG GCCGGGGCGG CGCGGGCTTC
CCCACCGGCG TGAAATGGTC GTTCATGCCC AAGAATTCGG ACGGCCGTCC TCATTACCTG
GTGATCAACG GCGACGAATC CGAACCCGGC ACCTGCAAGG ATCGCGAGAT CCTGCGCCAT
GACCCGCACA AGCTGATCGA GAGCGCGCTG ATCGCCTCGT TCGCCATGGG GGCGCATGTC
GCCTACATCT ACATTCGCGG CGAATTCTTC AACGAAGCCC GGCATCTGCA GATCGCGATC
GACGAGGCCT ATGCCGCCGG CCTGATCGGC CAGAACGCCG CCGGCTCGGG CTGGGATTTC
GATTTCTACA TTCATCGCGG CGCCGGCGCC TATATCTGCG GCGAGGAAAC CGCGCTGCTG
GAAAGCCTGG AAGGCAAGAA GGGCCAGCCC CGGATGAAGC CGCCTTTCCC GGCGGCAATG
GGCCTGTATG GCTGCCCGAC CACGGTGAAC AACGTGGAAA GCATTGCCGT CGCCGCGACC
ATCCTGCGGC GCGGGGCTAC GTGGTTCTCG TCGCTGGGAC GCCCGAACAA CGCGGGCACC
AAGCTGATGG CCATATCGGG CCACGTGAAC ACGCCCTGCG TGTTCGAGGA AGAACTCGGC
GTTCCGCTGA AGGACATCAT CGAAAAGCAT GGCGGTGGCG TGCGCGGCGG CTGGGACAAC
CTGCTGGCGG TGATCCCGGG CGGCTCTTCG GTCCCGCTGC TGCCGGCCTC GGTCTGCGAG
ACCGTGCTGA TGGATTACGA CAGCCTGCGG GCCGAACGGT CGGGCCTGGG CACGGCATGC
ATGATCGTCA TGGACAAATC CACCGACGTC ATCCGCGCCA TCGCCCGCTT CTCGCAATTC
TACAAGCATG AAAGCTGCGG CCAGTGCACG CCCTGCCGCG AAGGCACGGG CTGGATGATG
CGGGTCATGC ACCGCATGGT CGAAGGCCGG GCCGAAATCG AGGAAATCGA CATGCTCGAA
CAGGTGACGC GCCAGGTCGA AGGGCACACG ATCTGTGCCC TCGGCGATGC GGCGGCATGG
CCGATCCAGG GCCTGATCCG GCATTTCCGC CCGGTGATGG AGGAGCGGAT CCGGGCCTAC
AAGGCCACCC ACGGCGGCGG TCAGATCACG CAGGTGCCCC CGGGTATCCC CGTGGCGGCG
GAGTAA
 
Protein sequence
MLQDKDRIFT NLYGQQDWRL AGARARGDWD GTAEIIARGR DAIINEMKAS GLRGRGGAGF 
PTGVKWSFMP KNSDGRPHYL VINGDESEPG TCKDREILRH DPHKLIESAL IASFAMGAHV
AYIYIRGEFF NEARHLQIAI DEAYAAGLIG QNAAGSGWDF DFYIHRGAGA YICGEETALL
ESLEGKKGQP RMKPPFPAAM GLYGCPTTVN NVESIAVAAT ILRRGATWFS SLGRPNNAGT
KLMAISGHVN TPCVFEEELG VPLKDIIEKH GGGVRGGWDN LLAVIPGGSS VPLLPASVCE
TVLMDYDSLR AERSGLGTAC MIVMDKSTDV IRAIARFSQF YKHESCGQCT PCREGTGWMM
RVMHRMVEGR AEIEEIDMLE QVTRQVEGHT ICALGDAAAW PIQGLIRHFR PVMEERIRAY
KATHGGGQIT QVPPGIPVAA E