Gene TM1040_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2133 
Symbol 
ID4076447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2238343 
End bp2239470 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID638007453 
Productpolysaccharide export protein 
Protein accessionYP_614127 
Protein GI99081973 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTAC GGTGGGCGCG CCCCGTGGCC TTGTTGGCTG CGGTTGCCCT CGCAGCATCA 
TGCGGGCTGC CCAAGGTGGG CCCGAACAAA CGTGAGATCT TTGCAGGGTC GGTACAAAAA
CAGGGCGATG CCTTTGTGGT CTCGGTCAAC GACCGCGTTG CCCGGGCAAC AGCCGTGGTA
CCCGCACTTG GGTTTTCCGA CGCCTTCACC AAAGCATCGG TTCTTACCTC CGACATCATT
CGCCCCGGCG ATATCCTTGG CCTGACGATC TGGGAAAACG TCGACGACGG GCTGCTTGCC
AGCGCCGGCG CCAATGCCAC CCTTCTCGAA GAAGTGCAGG TTGATGGTGC GGGCTTCATC
TTTGTGCCCT ACGCAGGCCG CGTGCGCGCC TCGGGCAATA CGCCAGAGCA GTTGCGCGAA
GCCATCACCA AGAAGCTCGA AGACCAGACG CCCGACCCGC AGGTTCAGGT GCGCCGCCTT
GCCGGCGATG GCGCCACAGT CAGCCTCACC GGAGCGGTGG GCGCGCAGGG GGTCTATCCA
ATCGAACGTC CGACGCGCAC TCTGGCCACC ATGCTGGCGC AAGCTGGCGG CGTGGCGATC
GAACCCGAGA TTGCGCAGGT CTCTGTGACC CGCCAAGGGC AGACTGGCAC GATCTGGTTC
GAGGACCTCT ACGACCACCC CCAGATGGAC ATCGCGCTGC GCAATGGCGA CAAGATCCTT
GTGGAAGGCG ATACGCGCTC CTTTACCGCG CTGGGAGCGA CCGCGGCGCA GGCCCGTGTA
CCTTTCGAGA GCCAGAACCT CTCGGCGCTT GAAGCTCTTG CACAGGTCGG CGGCCTGATC
GCCACGGCAT CCGATCCCAC CGGTGTCTTT GTCTTCCGCA ATGAACCTGA AGCGATCTCA
AATCAGGTGC TTGGGCGTGA CGATCTGATC GGCGCGCAGC GCATGATCTA CGTGCTGAAC
CTCACTCAGC CCAACGGTCT CTTCATTGCC CGCGACTTCG TGATCCGCGA TGGCGACACC
ATCTATGTGA CCGAAGCACC CTATGCCCAA TGGACCAAGA CGCTGTCTCT TCTGACCAGC
CCGCTGTCCA CGGCTGCAAG TGTCGAGACC CTGTTCGGCG GCAGTTAA
 
Protein sequence
MTLRWARPVA LLAAVALAAS CGLPKVGPNK REIFAGSVQK QGDAFVVSVN DRVARATAVV 
PALGFSDAFT KASVLTSDII RPGDILGLTI WENVDDGLLA SAGANATLLE EVQVDGAGFI
FVPYAGRVRA SGNTPEQLRE AITKKLEDQT PDPQVQVRRL AGDGATVSLT GAVGAQGVYP
IERPTRTLAT MLAQAGGVAI EPEIAQVSVT RQGQTGTIWF EDLYDHPQMD IALRNGDKIL
VEGDTRSFTA LGATAAQARV PFESQNLSAL EALAQVGGLI ATASDPTGVF VFRNEPEAIS
NQVLGRDDLI GAQRMIYVLN LTQPNGLFIA RDFVIRDGDT IYVTEAPYAQ WTKTLSLLTS
PLSTAASVET LFGGS