Gene Dshi_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1421 
Symbol 
ID5712598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1473533 
End bp1474705 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID641267334 
Productribose ABC transporter 
Protein accessionYP_001532764 
Protein GI159043970 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.194337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATGC GAATTTTGGC ATCTTTGGGG TTGGTGTCGA CCTTGATGAT AAACACTGCC 
GCGATGGCAG AGGGCCTCGA AGAGTTGCCG CCGAAGCTTC AGGCCGCCTA TCAGGGGGTC
GACGAGGGCC AGCCGATCGG GACGTCGGCC TATCGCGACT GGACGCCTCG GTCCGGTCCG
CCCTGGACCA TCGGCTATGC AAGCTCCTAT GCCGGCAACA CGTGGCGCGC AGAGGGCTTG
AGCCGTCTGA CCGAGGATTT GCTGCCGGTT TACAGGCAGG CGGGGCTGGT TGACGAGATC
ATCGTCACGC AATCGGATCT CAACGACGCG CGCCAGATCC AGCAGATCCG ACAGCTTGTG
GACCAGGGCG TGGATGCGAT CATCGTGTGC TGCTCGAACC CGGTCGCCCT GAACAAGGCC
GTCGAATATG CCTACTCCAA GGGTGTCGTG GTGTTTTCCT ATTCGGGCTA TCTGACGTCG
GACAAGGCGC TGAACGCCTC GTCGAACTAT ACGCTGGGCG GCTATGAAAT CGCCAAGGCG
ATGATCGAGG AAGTGGGCGG CGAGGGGAAC TTCCTGCTGG TGTCGGGGAT CGCGGGCGCG
GCCTCTTCGG AGAGCTTCGA CACCGGCGCC ATGCGCGCCT TGGAGGAGTT TCCGAACGCC
AAGCTGGTTG GCCAGGTCTG GGGCAACTGG ACCGACCAAG TCGCCCAGAC CGAGGTTCAG
AAGTTCCTCG CAACCAACCC CGCGCGGATT GACGGGATCA TCGCGCAGGG CTCCCAGGAA
ACCGGTGTGC TGAAGGCGGT GTTGCAGTCG GGCCGCGAGG TGATGCCGAT CTCGCTGGCA
GGCTCGGCCG GAGCGGCTTG CTATCTAAAG CAGAACCCCG ATTGGATCAG CCATGCGTTC
CAGATCTGGC CCCCGGGCGA CGAGATGGAA CTGGGCTTCA ACTCGGTGAT CCGCACGCTG
CAGAACCAGG GTCCCAAACT GCAATCGATC CTGCGCGGGG TCTACCGGCT GCCCGCGGCG
GAATACGTGG CGAGCCTGGG CGATGACTGC TCGGTCGACT CGACCGCGTA CATCCAGCCG
GGCATCGACG TCTGGTTCCC CGACGACAAG GCCGCCGGCT ACTTCCTGCG GCCGGAAAAC
CCGCTCGATT GGGCCGCCAA GAACGTCAAC TGA
 
Protein sequence
MRMRILASLG LVSTLMINTA AMAEGLEELP PKLQAAYQGV DEGQPIGTSA YRDWTPRSGP 
PWTIGYASSY AGNTWRAEGL SRLTEDLLPV YRQAGLVDEI IVTQSDLNDA RQIQQIRQLV
DQGVDAIIVC CSNPVALNKA VEYAYSKGVV VFSYSGYLTS DKALNASSNY TLGGYEIAKA
MIEEVGGEGN FLLVSGIAGA ASSESFDTGA MRALEEFPNA KLVGQVWGNW TDQVAQTEVQ
KFLATNPARI DGIIAQGSQE TGVLKAVLQS GREVMPISLA GSAGAACYLK QNPDWISHAF
QIWPPGDEME LGFNSVIRTL QNQGPKLQSI LRGVYRLPAA EYVASLGDDC SVDSTAYIQP
GIDVWFPDDK AAGYFLRPEN PLDWAAKNVN