Gene Dshi_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1274 
Symbol 
ID5711832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1324124 
End bp1325101 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content64% 
IMG OID641267186 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001532617 
Protein GI159043823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.031802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.351389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGG ACATAGTCTG GGAGGACGAC ATGAAAACCA TCCTGAAATC GGTGGCGCTT 
GGTGCCGCAC TGGTCGCGGC ACCTTTCGCA TCCTTCGCGC AGAGTGACGA TGATCTGAAT
TACGTGCTGG TCAGCCATGC ACCCGACAGT GACACCTGGT GGAACACCAT CAAGAACGGC
ATCGCGCTGG CGGGCGAGCA GATGGGCGTG TCGGTCGAAT ACCGCAACCC GCCCACCGGT
GACATTGCCG ACATGGCGCG AATCATCGAG CAGGCCGCGG CCTCCGCGCC CGATGGCATC
ATCACCACGC TGGCGGATTT CGACGTGCTG CAAGGGCCGA TCAAGAACGC GGTCGATCAG
GGCATCGATG TCATCATCAT GAATACCGGC ACACCCGAAC AGGCCCGCGA GATCGGCGCC
CTGATGTATG TCGGCCAGCC CGAGTACGAC GCGGGCTTCG CCGCCGGGCA GCGCGCCAAG
GGCGAGGGGG TCACCAAGTT TCTTTGCGTG AACCACGCGA TCCAGCAGCC CACCGTGGGC
GAGCGCTGCC GCGGCTATGC CGACGGGCTC GGGATCGAGC TGGGCGATGC GATGATGGAC
AGCGGCACCG ACCCCGCCGA GATCAAGAAC AAGGTCATGG CCTACCTGTC CACGAATGAA
GACGTCGATG GCATCCTGAC CCTCGGCCCG GTCTCGGCGG ACCCGACCAT CGCGGCGCTG
AACGAGATGG GCCTGGCGGG CGAAATCCAT TTCGGCACCT TCGATCTGGG CGAGGAAATC
GTGAAGGCGA TCAAGGACGG CACCATCAAC TGGGGCATCG ACCAGCAGCC CTTCCTGCAG
GCCTACATGC CGGTGGTGAT CCTGGCCAAC TGGGACCGCT ACGGGGTTTT GCCGGGCAAC
AACATCAACT CCGGCCCAGG CTTCGTGACC GCCTCCGGTC TGGAGAAGGT CGAGGCCTTC
GCGGGCGAGT ACCGCTAA
 
Protein sequence
MRADIVWEDD MKTILKSVAL GAALVAAPFA SFAQSDDDLN YVLVSHAPDS DTWWNTIKNG 
IALAGEQMGV SVEYRNPPTG DIADMARIIE QAAASAPDGI ITTLADFDVL QGPIKNAVDQ
GIDVIIMNTG TPEQAREIGA LMYVGQPEYD AGFAAGQRAK GEGVTKFLCV NHAIQQPTVG
ERCRGYADGL GIELGDAMMD SGTDPAEIKN KVMAYLSTNE DVDGILTLGP VSADPTIAAL
NEMGLAGEIH FGTFDLGEEI VKAIKDGTIN WGIDQQPFLQ AYMPVVILAN WDRYGVLPGN
NINSGPGFVT ASGLEKVEAF AGEYR