Gene TM1040_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0654 
Symbol 
ID4078167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp700651 
End bp702408 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content61% 
IMG OID638005951 
Productcapsule polysaccharide export protein-like 
Protein accessionYP_612649 
Protein GI99080495 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID[TIGR01010] polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACGA AACCCAAGGC TAAAAAATTC CGTATCCGCC GCCCGAGCTC GGGAGCTGAA 
CAGCCGCAGG CCGCTGCGGC GGCGACCGCT CGCCCGGTGC CTCCGCCGCC CGCACAGGAG
ACCGGACGCG ACCAGCCGCT TGAGATGCAG TCTCAGGTGG AACAGGCTGC AAAGGCTTCT
TCCGCGGGCG ATTTTGCGCC TGCCGCGTCC GACACACCAG CGCGCGCGTC CTCGCCACAG
ATGACGCCAT CCCAGAGCGG CGCCACAACG CCACCGCAGG ATGCATCCGA GGATGTGGGC
TCGACCGATA TTGAAGACAT CAAGCGTGAG GGCCTCACGG GGCGTCAATT GCGCCTCGCC
CGTCGCGTTG CACAAAAGCA CGATCTGCCT GCGACATCGG ATTACGACGC TGTGCGTTTG
CTGCGGCTGC GCGGCATTGA TCCATTCAAA CGCGCCAACA TGCTAGAGCT GGTGGTGCCG
CAAAGCCAAA ACGCAAGCGT GCCCGCGACC CAAGCCCCAC AGGGCGCGCC CAAGCCACAG
ACGCTTCCGC AGACGGTAGA GAAAAGCAAA CCCAGCGCAC CGCCCGCCGA TCATCTGAGC
CCGACCGAGC GACGCAACCG CGAAATTCGC TCCATCCAGC GTGACATCGC GCGACGCCGC
CGCCGTAAAA TGGCGCTCTT GGGGGCGCGG CTCGGGGCCT TTGTTCTGAT CCCTACGCTT
CTGGCAGGGT ACTATTATTA CAAGGTCGCA ACACCGATGT ATGCGTCGCA TTCGGAGTTC
CTTGTTCTCA AAGCCGACAG CACCGGGTCT TCGGGGTTCG GCGGCCTTCT GAGCGGCACG
CAATTTGCCA CCAGCCAGGA TTCCATTGCG GTACAGGCCT ATCTGCAATC CAAGGTCGCG
ATGCGCCGCC TCGACGAAGA GGCCGGATTT CGCGCGCATT TCTCGCAGGA CTGGATCGAC
CCAATTCAGC GACTGGAACC AGACGCCAGC AATGACGACG CCTACAAAAC CTATCAGCGC
AATGTAAAAA TCGGCTATGA TCCCACCGAG GGCGTCATTC GCATGGATGT CTCTGCGGCA
GAACCAGCGG TTGCGGCAGA GTTTTCACGC CGCCTGATTT CCTATGCACA GGAAAACGTG
AACCACCTTT CCGAGCAAAA GCGCGCCGAT CAGGTGGGCG ACGCCGAGGA GGCGCTTGCC
CTTGCAGAGC AGCAACGCCG CGACGCCCAG GCAGAACTTG TGCGCCTGCA GCAGCAAGGG
TCGGTCCTCG ATCCCGAAGG GGTCATTGCC TCGCTGCGCT CCCAGATCAA CACGTTTGAG
CTTCAACTGC AGCAAAAGCG CCTGGAGCTC GCGGCGTTGC AGGACAACCT GCGCCCCAAC
GCCGCCAAGG TCGAAGGCAC TGCCGCAGAC ATCAAACGCC TTGAGGCGCT GATTGCAAAT
CTCAACGAAC GCATGACCGA TGCGTCTCAG GGCGAGAACT CGCTTGCCTC GCTGAGCGTC
AAGATCCAGA TGGCACAAGC GGACCTCGCG ACGCGCGACA TGATGCTGCA ATCCGCCCTG
CAACAGGTCG AACAAACCCG TATGGAGGCA AACCGCCAGG TGCGCTATCT GACAACTGCG
GTCGAGCCGG TTCCCGCCGA CACACCCTCC TCGCCGCGCA AGTTCGAAAA TACGATTTTG
GCTTTCCTGA TCTTTTCCGG TATCTACCTG ATGTGTGCCC TCACGGCATC CATTCTTCGG
GAACAGGTCT CTTCGTAA
 
Protein sequence
MTTKPKAKKF RIRRPSSGAE QPQAAAAATA RPVPPPPAQE TGRDQPLEMQ SQVEQAAKAS 
SAGDFAPAAS DTPARASSPQ MTPSQSGATT PPQDASEDVG STDIEDIKRE GLTGRQLRLA
RRVAQKHDLP ATSDYDAVRL LRLRGIDPFK RANMLELVVP QSQNASVPAT QAPQGAPKPQ
TLPQTVEKSK PSAPPADHLS PTERRNREIR SIQRDIARRR RRKMALLGAR LGAFVLIPTL
LAGYYYYKVA TPMYASHSEF LVLKADSTGS SGFGGLLSGT QFATSQDSIA VQAYLQSKVA
MRRLDEEAGF RAHFSQDWID PIQRLEPDAS NDDAYKTYQR NVKIGYDPTE GVIRMDVSAA
EPAVAAEFSR RLISYAQENV NHLSEQKRAD QVGDAEEALA LAEQQRRDAQ AELVRLQQQG
SVLDPEGVIA SLRSQINTFE LQLQQKRLEL AALQDNLRPN AAKVEGTAAD IKRLEALIAN
LNERMTDASQ GENSLASLSV KIQMAQADLA TRDMMLQSAL QQVEQTRMEA NRQVRYLTTA
VEPVPADTPS SPRKFENTIL AFLIFSGIYL MCALTASILR EQVSS