Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0654 |
Symbol | |
ID | 4078167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 700651 |
End bp | 702408 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005951 |
Product | capsule polysaccharide export protein-like |
Protein accession | YP_612649 |
Protein GI | 99080495 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | [TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACGA AACCCAAGGC TAAAAAATTC CGTATCCGCC GCCCGAGCTC GGGAGCTGAA CAGCCGCAGG CCGCTGCGGC GGCGACCGCT CGCCCGGTGC CTCCGCCGCC CGCACAGGAG ACCGGACGCG ACCAGCCGCT TGAGATGCAG TCTCAGGTGG AACAGGCTGC AAAGGCTTCT TCCGCGGGCG ATTTTGCGCC TGCCGCGTCC GACACACCAG CGCGCGCGTC CTCGCCACAG ATGACGCCAT CCCAGAGCGG CGCCACAACG CCACCGCAGG ATGCATCCGA GGATGTGGGC TCGACCGATA TTGAAGACAT CAAGCGTGAG GGCCTCACGG GGCGTCAATT GCGCCTCGCC CGTCGCGTTG CACAAAAGCA CGATCTGCCT GCGACATCGG ATTACGACGC TGTGCGTTTG CTGCGGCTGC GCGGCATTGA TCCATTCAAA CGCGCCAACA TGCTAGAGCT GGTGGTGCCG CAAAGCCAAA ACGCAAGCGT GCCCGCGACC CAAGCCCCAC AGGGCGCGCC CAAGCCACAG ACGCTTCCGC AGACGGTAGA GAAAAGCAAA CCCAGCGCAC CGCCCGCCGA TCATCTGAGC CCGACCGAGC GACGCAACCG CGAAATTCGC TCCATCCAGC GTGACATCGC GCGACGCCGC CGCCGTAAAA TGGCGCTCTT GGGGGCGCGG CTCGGGGCCT TTGTTCTGAT CCCTACGCTT CTGGCAGGGT ACTATTATTA CAAGGTCGCA ACACCGATGT ATGCGTCGCA TTCGGAGTTC CTTGTTCTCA AAGCCGACAG CACCGGGTCT TCGGGGTTCG GCGGCCTTCT GAGCGGCACG CAATTTGCCA CCAGCCAGGA TTCCATTGCG GTACAGGCCT ATCTGCAATC CAAGGTCGCG ATGCGCCGCC TCGACGAAGA GGCCGGATTT CGCGCGCATT TCTCGCAGGA CTGGATCGAC CCAATTCAGC GACTGGAACC AGACGCCAGC AATGACGACG CCTACAAAAC CTATCAGCGC AATGTAAAAA TCGGCTATGA TCCCACCGAG GGCGTCATTC GCATGGATGT CTCTGCGGCA GAACCAGCGG TTGCGGCAGA GTTTTCACGC CGCCTGATTT CCTATGCACA GGAAAACGTG AACCACCTTT CCGAGCAAAA GCGCGCCGAT CAGGTGGGCG ACGCCGAGGA GGCGCTTGCC CTTGCAGAGC AGCAACGCCG CGACGCCCAG GCAGAACTTG TGCGCCTGCA GCAGCAAGGG TCGGTCCTCG ATCCCGAAGG GGTCATTGCC TCGCTGCGCT CCCAGATCAA CACGTTTGAG CTTCAACTGC AGCAAAAGCG CCTGGAGCTC GCGGCGTTGC AGGACAACCT GCGCCCCAAC GCCGCCAAGG TCGAAGGCAC TGCCGCAGAC ATCAAACGCC TTGAGGCGCT GATTGCAAAT CTCAACGAAC GCATGACCGA TGCGTCTCAG GGCGAGAACT CGCTTGCCTC GCTGAGCGTC AAGATCCAGA TGGCACAAGC GGACCTCGCG ACGCGCGACA TGATGCTGCA ATCCGCCCTG CAACAGGTCG AACAAACCCG TATGGAGGCA AACCGCCAGG TGCGCTATCT GACAACTGCG GTCGAGCCGG TTCCCGCCGA CACACCCTCC TCGCCGCGCA AGTTCGAAAA TACGATTTTG GCTTTCCTGA TCTTTTCCGG TATCTACCTG ATGTGTGCCC TCACGGCATC CATTCTTCGG GAACAGGTCT CTTCGTAA
|
Protein sequence | MTTKPKAKKF RIRRPSSGAE QPQAAAAATA RPVPPPPAQE TGRDQPLEMQ SQVEQAAKAS SAGDFAPAAS DTPARASSPQ MTPSQSGATT PPQDASEDVG STDIEDIKRE GLTGRQLRLA RRVAQKHDLP ATSDYDAVRL LRLRGIDPFK RANMLELVVP QSQNASVPAT QAPQGAPKPQ TLPQTVEKSK PSAPPADHLS PTERRNREIR SIQRDIARRR RRKMALLGAR LGAFVLIPTL LAGYYYYKVA TPMYASHSEF LVLKADSTGS SGFGGLLSGT QFATSQDSIA VQAYLQSKVA MRRLDEEAGF RAHFSQDWID PIQRLEPDAS NDDAYKTYQR NVKIGYDPTE GVIRMDVSAA EPAVAAEFSR RLISYAQENV NHLSEQKRAD QVGDAEEALA LAEQQRRDAQ AELVRLQQQG SVLDPEGVIA SLRSQINTFE LQLQQKRLEL AALQDNLRPN AAKVEGTAAD IKRLEALIAN LNERMTDASQ GENSLASLSV KIQMAQADLA TRDMMLQSAL QQVEQTRMEA NRQVRYLTTA VEPVPADTPS SPRKFENTIL AFLIFSGIYL MCALTASILR EQVSS
|
| |