Gene Dshi_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4076 
Symbol 
ID5714621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009958 
Strand
Start bp9370 
End bp10845 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content68% 
IMG OID641276983 
Productcapsule polysaccharide export protein 
Protein accessionYP_001542279 
Protein GI159046610 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.054878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACAGC CCAAACCCTC CCCCCAGGAC AGGGGCCCCT CCGCCCCCGA GCCCAAGCCC 
GGTGCACCCG ACAGGGCCGC GCAGCAGGGC GCCGCGGCCT CGAAGGCGCC GGGGTCGGGC
GGTCAGAACA GACCCGGCGG TCCGGACGGG GCAAGCCCGA AGGCGCGGCC TCAACCCGTC
GTCCAGGCCT CGAAAGAGAC TGCTAAAAAG ACCGGAAAAG ACACTTTAAA GGACACGGGG
CACAAGACCG CCGCCCCGCA GCCCGCCCAG AAACCGGCCC AGAAACCCAC ACCGAAACCC
GCGCCTACAA AGGGCCAGGC CGAAGCGGCG CAGATCCGCA TCGCCCCACC GGTTCCCCCG
GCCCGCCGCC GTCCTCATCA CAAGTTGCTG ATGATCAGCT TCGCGATCAT GGTGGTCGTC
CCGATCCTGG TCTCGGCCTG GTATCTCTGG GCCCGGGCTC AGGACCAATA CGCGTCTTAC
GTGGGCTTTT CCGTGCGCAC CGAGGAGGTC GGCTCGGCCA TCGAGCTTCT GGGCGGGATC
ACCGAGCTGT CGGGCTCGTC GAGCTCGGAC ACCGATATTC TCTACAAGTT CCTGCAAAGC
CAGGAGCTGG TGGCGACGGT GGATGCCGCG CTCGACCTGC GCGGGATCTG GTCGAAGGCC
GACCCGGAGG TAGACCCGAT CTTCGCCTAT GATCCCCCCG GCACCATCGA GGATCTGCAG
GATCACTGGC TGCGCAAGGT GTCGATCTAT TACGACAGCG GCTCGGGACT CCTGGACCTG
CGGGTGCTGG CCTTCGATCC CGCCGACGCG CGCGCCATCG CCGAGATGAT CTTCGCCGAG
AGCAGCGCGC GCATCAACGC GCTCTCGGCG CTGGCCCGGG AAGACGCCAT CGCCTATGCC
CGCGACGAGC TCACCCAGGC CGAGACCCGT CTGCGCGACG CGCGCCTTGC GCTCAACGAG
TTCCGCAACC GCACACAGAT CGTCGATCCG ACCATCGACA CCCAGGGCCA GATGGGGCTG
GTCAACACGC TGCAGGCGCA GCTGGCCGAT GCCCTGATCG AGGTGGACCT GCTGCGCGAG
ACGACGCGCA CCGGCGACCC GCGGATCACG CAAGGCGAAC TGCGCATCGC GGTGATCGAA
CGCCGCATCG AGGAGGAACG CCGCAAGGTG GGCCTGGGCG GCGGGGTCTC GGGGGACCGG
TCGGTCTTTG CCGACCTGGT GGGCGAGTTC GAGCGGCTCT CGGTGGACCT GGAATTCGCC
CAGCAGAGCT ACGTGGCAGC GCTCGCCACC TTCGACGCGG CCCGCAACGA GGCCCGCCGC
CAGAGCCGGT ACCTGGCCGC CCATGTCCGC CCGACCCTGG CCGAGCGGGC CGAGTACCCC
CAACGGATCT ACCTGCTGGG CCTGATCGGG CTGTTTTCCT TCCTGGCCTG GACGATCACG
GCCCTCATCG CCTATTCCCT TCGAGACCGG CGCTGA
 
Protein sequence
MTQPKPSPQD RGPSAPEPKP GAPDRAAQQG AAASKAPGSG GQNRPGGPDG ASPKARPQPV 
VQASKETAKK TGKDTLKDTG HKTAAPQPAQ KPAQKPTPKP APTKGQAEAA QIRIAPPVPP
ARRRPHHKLL MISFAIMVVV PILVSAWYLW ARAQDQYASY VGFSVRTEEV GSAIELLGGI
TELSGSSSSD TDILYKFLQS QELVATVDAA LDLRGIWSKA DPEVDPIFAY DPPGTIEDLQ
DHWLRKVSIY YDSGSGLLDL RVLAFDPADA RAIAEMIFAE SSARINALSA LAREDAIAYA
RDELTQAETR LRDARLALNE FRNRTQIVDP TIDTQGQMGL VNTLQAQLAD ALIEVDLLRE
TTRTGDPRIT QGELRIAVIE RRIEEERRKV GLGGGVSGDR SVFADLVGEF ERLSVDLEFA
QQSYVAALAT FDAARNEARR QSRYLAAHVR PTLAERAEYP QRIYLLGLIG LFSFLAWTIT
ALIAYSLRDR R