Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4076 |
Symbol | |
ID | 5714621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009958 |
Strand | + |
Start bp | 9370 |
End bp | 10845 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641276983 |
Product | capsule polysaccharide export protein |
Protein accession | YP_001542279 |
Protein GI | 159046610 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | [TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.054878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACAGC CCAAACCCTC CCCCCAGGAC AGGGGCCCCT CCGCCCCCGA GCCCAAGCCC GGTGCACCCG ACAGGGCCGC GCAGCAGGGC GCCGCGGCCT CGAAGGCGCC GGGGTCGGGC GGTCAGAACA GACCCGGCGG TCCGGACGGG GCAAGCCCGA AGGCGCGGCC TCAACCCGTC GTCCAGGCCT CGAAAGAGAC TGCTAAAAAG ACCGGAAAAG ACACTTTAAA GGACACGGGG CACAAGACCG CCGCCCCGCA GCCCGCCCAG AAACCGGCCC AGAAACCCAC ACCGAAACCC GCGCCTACAA AGGGCCAGGC CGAAGCGGCG CAGATCCGCA TCGCCCCACC GGTTCCCCCG GCCCGCCGCC GTCCTCATCA CAAGTTGCTG ATGATCAGCT TCGCGATCAT GGTGGTCGTC CCGATCCTGG TCTCGGCCTG GTATCTCTGG GCCCGGGCTC AGGACCAATA CGCGTCTTAC GTGGGCTTTT CCGTGCGCAC CGAGGAGGTC GGCTCGGCCA TCGAGCTTCT GGGCGGGATC ACCGAGCTGT CGGGCTCGTC GAGCTCGGAC ACCGATATTC TCTACAAGTT CCTGCAAAGC CAGGAGCTGG TGGCGACGGT GGATGCCGCG CTCGACCTGC GCGGGATCTG GTCGAAGGCC GACCCGGAGG TAGACCCGAT CTTCGCCTAT GATCCCCCCG GCACCATCGA GGATCTGCAG GATCACTGGC TGCGCAAGGT GTCGATCTAT TACGACAGCG GCTCGGGACT CCTGGACCTG CGGGTGCTGG CCTTCGATCC CGCCGACGCG CGCGCCATCG CCGAGATGAT CTTCGCCGAG AGCAGCGCGC GCATCAACGC GCTCTCGGCG CTGGCCCGGG AAGACGCCAT CGCCTATGCC CGCGACGAGC TCACCCAGGC CGAGACCCGT CTGCGCGACG CGCGCCTTGC GCTCAACGAG TTCCGCAACC GCACACAGAT CGTCGATCCG ACCATCGACA CCCAGGGCCA GATGGGGCTG GTCAACACGC TGCAGGCGCA GCTGGCCGAT GCCCTGATCG AGGTGGACCT GCTGCGCGAG ACGACGCGCA CCGGCGACCC GCGGATCACG CAAGGCGAAC TGCGCATCGC GGTGATCGAA CGCCGCATCG AGGAGGAACG CCGCAAGGTG GGCCTGGGCG GCGGGGTCTC GGGGGACCGG TCGGTCTTTG CCGACCTGGT GGGCGAGTTC GAGCGGCTCT CGGTGGACCT GGAATTCGCC CAGCAGAGCT ACGTGGCAGC GCTCGCCACC TTCGACGCGG CCCGCAACGA GGCCCGCCGC CAGAGCCGGT ACCTGGCCGC CCATGTCCGC CCGACCCTGG CCGAGCGGGC CGAGTACCCC CAACGGATCT ACCTGCTGGG CCTGATCGGG CTGTTTTCCT TCCTGGCCTG GACGATCACG GCCCTCATCG CCTATTCCCT TCGAGACCGG CGCTGA
|
Protein sequence | MTQPKPSPQD RGPSAPEPKP GAPDRAAQQG AAASKAPGSG GQNRPGGPDG ASPKARPQPV VQASKETAKK TGKDTLKDTG HKTAAPQPAQ KPAQKPTPKP APTKGQAEAA QIRIAPPVPP ARRRPHHKLL MISFAIMVVV PILVSAWYLW ARAQDQYASY VGFSVRTEEV GSAIELLGGI TELSGSSSSD TDILYKFLQS QELVATVDAA LDLRGIWSKA DPEVDPIFAY DPPGTIEDLQ DHWLRKVSIY YDSGSGLLDL RVLAFDPADA RAIAEMIFAE SSARINALSA LAREDAIAYA RDELTQAETR LRDARLALNE FRNRTQIVDP TIDTQGQMGL VNTLQAQLAD ALIEVDLLRE TTRTGDPRIT QGELRIAVIE RRIEEERRKV GLGGGVSGDR SVFADLVGEF ERLSVDLEFA QQSYVAALAT FDAARNEARR QSRYLAAHVR PTLAERAEYP QRIYLLGLIG LFSFLAWTIT ALIAYSLRDR R
|
| |