Gene Dshi_2253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2253 
Symbol 
ID5713906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2373608 
End bp2374894 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content65% 
IMG OID641268175 
Productputative capsule polysaccharide export protein 
Protein accessionYP_001533590 
Protein GI159044796 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.724395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAT CGCCCGCGCG GCGCTTTCTT TTCCTGCAAG GTCCCCACGG GCCCTTCTTC 
ACGCAACTCG CCGCGATGCT GACCCGCGCA GGCGCGAGCT GTTGGCGTGT GGGATTCAAT
GCCGGGGATG CGCGGTTCTG GAAAGACAAG CCCCGCTACA TCCCGTTTAC CGGACCGCGC
GCCGACTGGC CGCAGACCTG CGCCCAACTG CTGCGCAGCA AGGCGATCAC GGACATCGTG
CTCTACGGCG ACACCCGGTG GATTCACCGC ACGGCAGTGG AAGCCGCCCG GCAGCATGGT
TGCCGCATCC ATGTGTTCGA GGAGGGCTAT CTGCGCCCCT CCTGGGTCAC CTATGAGCGG
GGCGGCTCCA ACGGGCACTC GCCTCTGATG GACACCACGA TTCCTGAAAT GACGCAGGCC
CTGCGCGGCC ATGCGCCCGA GTTGCCGGGT GCCCCGGCGC GCTGGGGCGA TATTCGCCAG
CACATGTTCT ATGGCGCGGT CTATCACTTT CATGTCCTGT TCCGGAATTC GGACTACACC
GCGTTCAAGC CCCACCGCGC CCTGACCGTG CGGCAGGAAT TCCTGCTCTA CCTGAAGATG
CTGCTGCGTC TGCCACTGAC CATGGTGGAA CGGGCGCTTG CAACGCGCCG GATCCGGCGA
GGTGGGTTCC CCTATCACCT TTTCCTGCTG CAACTCGAAC ATGACAGCGC ATTTCAGGCC
CACAGCCCGT TCGCATCCAT GACCGAGGTG CTCGCCGATG TCATCGGCGC CTTTGCGCGC
GCAGCCCCCG CGCATCACCA CCTCGTCTTC AAGGCGCATC CGCTCGAGGA TGGTCGCGCG
CCTCTGCCGG CGGTAATCAA GCGCCTGTCC CGCGAGACCG GCATCGCCGA TCGCGTGCAT
TACGTGAAGG GTGGCAAACT TGCCCGCCTG CTCGACCAGG CGCGCGCAGT TCTGACGGTC
AATTCCACGG CTGCACAGCA GGCCTTGTGG CGCGGCCTGC CGGTCAAGAC CCTGGGCCAG
GCAGTCTATG CCAAGCCGGA GCTTGTCTCG GCCCAACCCC TGGAGGCGTT TTTTGCGGAC
CCGATCCCGC CCGACATGCG CGCTTATCGC GACTTCCGGC ACTACCTGCT GGAAAGCAGC
CAAGTGGCCG GAGGGTTCTA CTCGGCGACC GGGCGGCAAC ACCTTCTACG GCAGGTCACG
GACATGGTGT TGTCCGACCA GGACCCCTAT GGAGCCCTCA AGATCGGCGC ACCACGGCGA
CGCTTCCCGA TACGCGTTGT GAAATAA
 
Protein sequence
MTGSPARRFL FLQGPHGPFF TQLAAMLTRA GASCWRVGFN AGDARFWKDK PRYIPFTGPR 
ADWPQTCAQL LRSKAITDIV LYGDTRWIHR TAVEAARQHG CRIHVFEEGY LRPSWVTYER
GGSNGHSPLM DTTIPEMTQA LRGHAPELPG APARWGDIRQ HMFYGAVYHF HVLFRNSDYT
AFKPHRALTV RQEFLLYLKM LLRLPLTMVE RALATRRIRR GGFPYHLFLL QLEHDSAFQA
HSPFASMTEV LADVIGAFAR AAPAHHHLVF KAHPLEDGRA PLPAVIKRLS RETGIADRVH
YVKGGKLARL LDQARAVLTV NSTAAQQALW RGLPVKTLGQ AVYAKPELVS AQPLEAFFAD
PIPPDMRAYR DFRHYLLESS QVAGGFYSAT GRQHLLRQVT DMVLSDQDPY GALKIGAPRR
RFPIRVVK