Gene Dshi_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1872 
Symbol 
ID5712864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1951375 
End bp1952643 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content67% 
IMG OID641267796 
Productputative soluble aldose sugar dehydrogenase 
Protein accessionYP_001533215 
Protein GI159044421 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.413591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.509919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAC TGTCTGCCCT GTCGCTCGGC GCGGCACTGG CCTCTGCAAC CGCGCTTGCA 
GCTCCGGCCT TCGCGGCCAA CACCACGTCC CTGAGCCACG AGATTGTGCT CGAAGGGCTC
GAGAATCCGT GGGACGTCGC CTTTCTCGAA GACGGCACGA TGTTCTTCAC CGAGAAATGC
CTCGGCCTGT CCGTGCGCCT GCCCGATGGC TCGGTCAACA AGCTCTTGGG CATGAAGGGC
ACCGACGACG ACTATGCCTC CACCGCCGAG GACCTGTTTT GCGAAGGCCA GGCCGGGATG
CAGGGCGTCG CGGTTGACCC GGACTTCGCC GAGAATCGGC AGATCTATGT CTATTCGACC
TCCGACCTGA CCGCGCCCGG CTCCAACCGC CTGTTGCGGA TGACCGTGGG CGAGGATCTC
GCCAGCGTGG CGGACCGCAC CGACATCGTC GAGGACGTGC CCTACAAGCC CGCCGCGACC
GACCACCCCT TCGGTGGCCC CGGCGCCCAT AACGGCGGCC GCGTGCGCTT CGGCCCGGAC
GGGTTCATCT ACCTGACCAC CGGCGACACC CATAACGGCG AAGGCCCGCA AAGCCCGACC
CTGCTGGCGG GCAAGGTGCT GCGCATCGAC CGCGACGGCA ATGCCGCCGA GGGCAACGCG
CCCCCCGAAG GCTTCGATCC GCGGATCTAC ACCTACGGGC ACCGCAACAC CCAGGGCATC
ACCTTCCACC CGGAAACCGG TGCTGCCATC ACCGCCGAGC ACGGCCCCTG GCACTCCGAC
GAGATCACCG TTCTGCAGAA CGGCGGCAAT GCCGGCTGGG ATCCGCGCCC GAACGTGGGC
GGCCGAGGCG AATGCCCGGA TGGCTACTGC GGCTACTCCC CGAACCAGAT GGAGGGCATG
GACCGCTACG AGCGCGCGGC CTTCATGCCG ATGACCGATC TGGCAACCTA TCCGGACGCG
ATGCAGCCGA TCTGGGACAA TAACGGCTGG TCCCAGGGCA CGTCCTCTGC CGAGTTCCTG
ACCGGGGACC AGTGGGGCGA CTGGGAAAAC CACCTGGTCG TCGGCATCAT GGGCATTGGC
TTCGGCGGCA CGCCCATCGG CCAGCGCATC GACGTGATCG AGCTCAACGA GGCGGGCACC
GAGGTGGTCG ACGTCACCGA GATGACCCTG CCGATGGAGC CCGGTCGCTT CCGCTCCCTC
GTCCAGGGCC CCGATGGCGC GCTCTACGCG GTGGTGGATC AGGGGATGAT TCACAAGATG
ATGCCGTAA
 
Protein sequence
MTRLSALSLG AALASATALA APAFAANTTS LSHEIVLEGL ENPWDVAFLE DGTMFFTEKC 
LGLSVRLPDG SVNKLLGMKG TDDDYASTAE DLFCEGQAGM QGVAVDPDFA ENRQIYVYST
SDLTAPGSNR LLRMTVGEDL ASVADRTDIV EDVPYKPAAT DHPFGGPGAH NGGRVRFGPD
GFIYLTTGDT HNGEGPQSPT LLAGKVLRID RDGNAAEGNA PPEGFDPRIY TYGHRNTQGI
TFHPETGAAI TAEHGPWHSD EITVLQNGGN AGWDPRPNVG GRGECPDGYC GYSPNQMEGM
DRYERAAFMP MTDLATYPDA MQPIWDNNGW SQGTSSAEFL TGDQWGDWEN HLVVGIMGIG
FGGTPIGQRI DVIELNEAGT EVVDVTEMTL PMEPGRFRSL VQGPDGALYA VVDQGMIHKM
MP