Gene Gdia_1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1425 
Symbol 
ID6974834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1586858 
End bp1587988 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID643390956 
Productputative L-sorbosone dehydrogenase 
Protein accessionYP_002275820 
Protein GI209543591 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCC GCCGGACCAT GTCGATGCTT GTCCTTCTCG GGGCGATGCT GGCCGCCGCC 
GGGACACGGG CCGCGCCGCC GGTCGCGCGG CTGCGCCTGC CGCCGGGATT CCATGTCTCG
GTCTATACCG ACCAGGTTCC TTCGGCGCGG GAGATGGCGA TCGGCGCCCG GGGCACGCTG
TTCGTGGGCT CGATGACGGC GGGCGCGGTC TATGCCGTGA CCGATGACGG GCCGGGACGG
GGCCGCCGGG TGCGGGTCGT GGCGCGCGGG CTGACCATGC CGGTGGGCGT GGCCTTCCGG
GATGGCGACC TGTACATCTC CGACGTGCGC GACATCGTGG TCCTGCGCGG GATCGAGGAC
CGGCTGGACC ACCCGCCCGC GCCGCAGGTC GCCGTGCCGG ACCTGCCCTG GCGGGTGGGC
GACCATGGCT GGAAATTCAT CGCTTTCGGC CCGGACAGCA AGCTGTATGT GCCGATCGGC
GCGCCGTGCA ATATCTGCGA CGTCGGGCAC CGGTTCGGCC GGCTGATGCG CATGAATCCC
GACGGCACGG GGCGCGAGGA CGTGGCCTAC GGCCTGCGCA ACAGCGTGGG CTTCACGTGG
CAGCCGGGCC AACCGGGGCA GCCGGGGGCC GGCACGCTGT GGTTCACCGA TAACGGACGC
GACCTGATGG GCGACGACGT GCCCAGCGAC GAGCTGAACC GGGTGGACCA TGCCGGCCAG
TCCTTCGGCT ATCCCTATTG TCATCAGGGC GACGTGCCGG ACCCCGTCTT CGGGCGGGGC
CATCCGTGTT CCGACTTCAC GCCGCCGGTG CTCAAGCTGG GCGCGCATGT CGCGGCCCTG
GGCCTGCGCT TCTATACCGG CAGCCAGTTT CCCGCGGCGT GGCGCGGCGC CCTGCTGATC
GCCGAACACG GGTCGTGGAA TCGCAGCCGG CTGGCGGGTT ATCGCGTCAT GGCGGTGCGC
TTCGGCCCGG ATGGGGGTAT CGCGTCCTAT GTGCCGCTGA TCGACGGGTT CCAGCAGGAT
GAAACCCCGT GGGGCCGCCC CGCCGACGTG CAGCCCCTGC CGGACGGCAG CGTGCTGGTC
AGCGACGACC TGGCCGGCGC GATCTATCGC GTGACCTATG GCAGGGACTG A
 
Protein sequence
MPFRRTMSML VLLGAMLAAA GTRAAPPVAR LRLPPGFHVS VYTDQVPSAR EMAIGARGTL 
FVGSMTAGAV YAVTDDGPGR GRRVRVVARG LTMPVGVAFR DGDLYISDVR DIVVLRGIED
RLDHPPAPQV AVPDLPWRVG DHGWKFIAFG PDSKLYVPIG APCNICDVGH RFGRLMRMNP
DGTGREDVAY GLRNSVGFTW QPGQPGQPGA GTLWFTDNGR DLMGDDVPSD ELNRVDHAGQ
SFGYPYCHQG DVPDPVFGRG HPCSDFTPPV LKLGAHVAAL GLRFYTGSQF PAAWRGALLI
AEHGSWNRSR LAGYRVMAVR FGPDGGIASY VPLIDGFQQD ETPWGRPADV QPLPDGSVLV
SDDLAGAIYR VTYGRD