Gene Gdia_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2839 
Symbol 
ID6976270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3103354 
End bp3104460 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID643392346 
Productoxidoreductase domain protein 
Protein accessionYP_002277185 
Protein GI209544956 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCGA GGCGTTTCCG CGTTGGTGTT GTAGGCCTCC AGCCCGGCAG GAGTTGGGCC 
GCGCGCGCTC ATGTGCCGGC GCTGCGCGCG CTATCTGACA GCTTCGAGAT TGCGGGCGTC
GCCAACACCA GCTTGGAGAG CGCCGAGCGT GCCGCTGCTG AAATCGGTCT GCCGAGAGCC
TATGCGAATG TCGCCCAATT GGTGGCGGAC GCAGAGGTGG ACATCGTCGC CGTCACGGTG
AAGGTGCCGC ATCACCTCGA AATCTCCAGG GCGGCCATCG AAGCAGGCAA GCACGTCTAC
TGCGAATGGC CGCTCGGTAA CGGGCTCGCC GAGGCCGAGG AGATCGCAGC GCTCGCCAAG
GCGAAGGGCG TGCTGGGCGT GGTCGGCACG CAGGCCCGTG TCGCGCCCGA GATTCAGTAT
CTGAAGCAAT TGATAGCCGA AAGCTTCATC GGCGAGGTGC TGTCGACGAC GTTGATCGCG
CGCGGCGGCG GCTGGGGGGG CATCGTTCCG GACAAGAAAA ATGGCGCCTA CCTTCTCGAC
AAGGCCAGCG GCGCGACCAT GCTCACAATT CCGGTCGGGC ATACCCTTGC CGGATTGACG
GAGGTGCTGG GCCCAATCGC AGAGCTTTCT TCGGTATTGG CCACCCGCCG CACGACGGCG
GTCGTCGCGG GAACCGATGA AACGTTACCT GTCACCGCGG CCGACCAAGT GCTGGTTAAC
GGCGTCCTTG CCAGCGGCGC GCCCATTTCG GTCCACTACG TCGGCGGCAT GCCGCGAGAC
GGTGCGGGCC TGTTGTGGGA GATCAACGGC ACGCAGGGGG ATGTCCGGGT GAAGGGCCCC
CTCGGGCACG CCCAGCTCGT GCCCCTCACG CTCGAGGCCG CACGCGGTGA CGAGAAAGCG
TTCCAGCGGC TGGAGGTTCC CGCTTCATAT CTCGATGGCC TCCCGTCGGA CCCCGCGCTT
GGAAACGTCG CGCGCAACTA TGCGCGCATG GCCCGTGATC TGCGCGACGG CGCCCACACA
GCGCCAACAT TCGACGATGC TGTCGTGCTC CACCGCGTCA TCGCGGCGAT CGAGACGGCC
GCGACACACT CGATAAAGCC AGCCTGA
 
Protein sequence
MVARRFRVGV VGLQPGRSWA ARAHVPALRA LSDSFEIAGV ANTSLESAER AAAEIGLPRA 
YANVAQLVAD AEVDIVAVTV KVPHHLEISR AAIEAGKHVY CEWPLGNGLA EAEEIAALAK
AKGVLGVVGT QARVAPEIQY LKQLIAESFI GEVLSTTLIA RGGGWGGIVP DKKNGAYLLD
KASGATMLTI PVGHTLAGLT EVLGPIAELS SVLATRRTTA VVAGTDETLP VTAADQVLVN
GVLASGAPIS VHYVGGMPRD GAGLLWEING TQGDVRVKGP LGHAQLVPLT LEAARGDEKA
FQRLEVPASY LDGLPSDPAL GNVARNYARM ARDLRDGAHT APTFDDAVVL HRVIAAIETA
ATHSIKPA