Gene Dshi_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3141 
SymbolugpC3 
ID5712197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3307792 
End bp3308850 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content65% 
IMG OID641269068 
Productsugar ABC transporter 
Protein accessionYP_001534475 
Protein GI159045681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.204703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGG TGATCCTCAA GGACCTGACC AAGCGGTGGG GCGATTTCGT CGGGGTGGAC 
AACCAGTCCC TGCATGTCCG CGACGAGGAA TTCCTGGTGC TGCTGGGCCC CTCGGGCTGC
GGCAAGACCA CGACCATGCG GATGATCGCC GGGCTGGAGG ATCCGACCGA TGGCGAGATC
TGGATCGGCG ACCGAATGGT CAACGACGAC CTGCCCAAGG ACCGCGACGT GGCCATGGTG
TTCCAGAATT ACGGTCTCTA TCCGCATATG ACGATCTTCG AGAACATCGC CTATCCCCTG
CGGGTGCGCG GCGTCGACAA GGCCGAGATT CCGCCGCGGG TCCAAAGGGC CGCCGAGCAG
GTGGAACTGA CCAAGTTCCT CCACCGCAAG CCCAAGGCGC TCTCCGGCGG GCAGCGGCAG
CGCGTGGCCC TGGCCCGCGC CATCGTGCGC AAGCCCAAGG TCTTCCTGAT GGACGAGCCG
CTGTCGAACC TCGACGCCAA GCTGCGCGTC ACCATGCGGG CGGAGCTGAA ACATCTCAGC
CGCGAGTTGC AGATCACCAC CGTCTACGTG ACCCACGACC AGATCGAGGC GATGACGCTG
GCCGACCGGG TCGCGGTGAT GAAGCACGGC GTGATTCAGC AACTCGGCAC CCCGGACGAG
ATCTACAACG ACCCCGCGAA CCTCTTCGTG GCGGGCTTCA TCGGCTCGCC CGCCATGAAC
CTGATCAACG GCTCGGTCGA GGACGGCATG TTCGTGACCA CCGGTGGCAC CCGGCTGGTC
AAGGTGCCCT CCCCGGACCG GGCGCGCGCG ATCCTCGGGG TGCGCGCCGA CGACATGCAG
GTCCACGAAG CCGGGCAGGG CGATATCGAC GTGACCATCT ATGCCTTCGA GAATACCGGC
GAGAGCACCC TTCTGACCGT GCAATGGGGC AAGCAGCGGG TGATCGCCCG CGGTGACCGG
CACCTGCGCA AGGAACAGGA CGATGTCGTC GGCATCAGCC TGAACACCGA CCATTTGTAC
CTCTTCGATC CGGACACCGA AGAGCGCATC AGGATGTAG
 
Protein sequence
MAEVILKDLT KRWGDFVGVD NQSLHVRDEE FLVLLGPSGC GKTTTMRMIA GLEDPTDGEI 
WIGDRMVNDD LPKDRDVAMV FQNYGLYPHM TIFENIAYPL RVRGVDKAEI PPRVQRAAEQ
VELTKFLHRK PKALSGGQRQ RVALARAIVR KPKVFLMDEP LSNLDAKLRV TMRAELKHLS
RELQITTVYV THDQIEAMTL ADRVAVMKHG VIQQLGTPDE IYNDPANLFV AGFIGSPAMN
LINGSVEDGM FVTTGGTRLV KVPSPDRARA ILGVRADDMQ VHEAGQGDID VTIYAFENTG
ESTLLTVQWG KQRVIARGDR HLRKEQDDVV GISLNTDHLY LFDPDTEERI RM