Gene Dshi_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2082 
Symbol 
ID5713077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2202111 
End bp2203370 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content65% 
IMG OID641268004 
Productputative sorbosone/glucose dehydrogenase 
Protein accessionYP_001533420 
Protein GI159044626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC TACTCGGGTC CGTGGCGCTG TGCGTCGCGG CGACCGGCGT CGCGCACGCG 
CAGGACAACA TGGAAAAATT GTCCAACATG CAGAAAACCG GCGCGACATT CACCTTCATC
GATCAGGGCG GCGACCGGGC CGAGGCGCTG CGCAACATCA TCCAGCACAT CAACGTGCCC
GACGGCTTCG AGGTCAGCCT CTATGCGGTT GTGCCCGATG CCCGCTCGAT GTCCATGGCG
CCCCAGGGCA CCGTCCTGTT CGCCGGCACG CGCAAGGACA AGGTCTGGTC CATCGTCGAC
CGGGATCGCG ACAGGGTCGC CGATGAGGTC AAGGACTTCG CGCCTTCGGT CACCTTCGAC
ATCCCCAACG GGCCGTGTTT CTCGCCGGAC GGGTTCCTGT ACATCGCCGA GCGGAACCGC
ATCCTGGTCT TCCCGGCTGC CGAGTTCTTC TTCGAGAGTC CGGATATCGC CGTGGGCACC
GTGGTGCCGC AGGGCGAGTT GATCCCGGTC GAGGAAGAGA GCTTCAACCA CTCCGCCCGG
GTGTGCGACA TCGGCCCGGA CGGCAAGCTT TATGTCTCGC TCGGCCAGCC GCACAACGTG
CAGCCGCTGG ACAAGATCGA GATGTATGAC GAGATCGGCA TCGGCGGCAT CATCCGGATG
AACACCGACG GCTCAGAGCG CGAGGTCTAT ACCCGCGGCG TGCGCAACTC GGTCGGGCAG
GATTTCAACC CGGCGACGGG TGAGTTGTGG TGGACCGACA ACCAGGTCGA CGGGATGGGC
GACGATATCC CGCCGGGCGA GTTGAACCGG CAGACCGAGG CGGGTCAGCA TTTCGGCTTC
CCCTGGACCA ATGCCCGGGT CGAGATCGTC TCGGAGGCGG ATTTCCCCCG GCCCGAGGGG
GTGACCTTTG TCGAGCCGCA ACTTGAGCTG ACCGCCCATG CGGCGGATCT GGGCATGCGG
TTCTACCACG ACAGCAGCTT CCCCGAGGCG TATCATGGGG GCATCTTCTG GGCGCAGCAC
GGGTCCTGGA ACCGCACCAC GCCGGTGGGC GCCCGGGTGA TGTTCACTGC CCTCGATCCC
GAAACCGGGG ATGCGGTGGG CGCGGAGGTA TTCGCCGATG GCTGGCTGAA CGAAGAGACC
GGCGAGTATC GCGGGCGTCC GATGGACATC GAATTCCTGC CCGATGGCTC GATGCTGGTC
TCGGATGACT TCGCCGGGGC GATCTGGCGG ATCGCCTATG TTGGGATGCC CGCCGAATGA
 
Protein sequence
MKSLLGSVAL CVAATGVAHA QDNMEKLSNM QKTGATFTFI DQGGDRAEAL RNIIQHINVP 
DGFEVSLYAV VPDARSMSMA PQGTVLFAGT RKDKVWSIVD RDRDRVADEV KDFAPSVTFD
IPNGPCFSPD GFLYIAERNR ILVFPAAEFF FESPDIAVGT VVPQGELIPV EEESFNHSAR
VCDIGPDGKL YVSLGQPHNV QPLDKIEMYD EIGIGGIIRM NTDGSEREVY TRGVRNSVGQ
DFNPATGELW WTDNQVDGMG DDIPPGELNR QTEAGQHFGF PWTNARVEIV SEADFPRPEG
VTFVEPQLEL TAHAADLGMR FYHDSSFPEA YHGGIFWAQH GSWNRTTPVG ARVMFTALDP
ETGDAVGAEV FADGWLNEET GEYRGRPMDI EFLPDGSMLV SDDFAGAIWR IAYVGMPAE