Gene TM1040_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1991 
Symbol 
ID4077175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2096154 
End bp2097575 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content63% 
IMG OID638007306 
ProductType I secretion outer membrane protein, TolC 
Protein accessionYP_613985 
Protein GI99081831 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.533391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC TATTGAAATC AAAGCGTATT AAGGCTTTTG CGACAATCGG CGTCCTCGCC 
ACGGCCGTCT TGAGCGCCCC GCAGGCGCGT GCGGATAATG TCACCGACGC GATGATCGGC
GCCTATACAA GCAGCGGCCT TCTGGAACAG AACCGAGCCC TGCTGCGTGC CGCAGACGAA
GGGGTTGCGG CCACGGTTGC GCGGCTGCGT CCGGTGATTA CAGCCACCCT CGCGATTGCA
CGCGATTACA ACCGCACGGG CTTTGGGACT GGGGGCGCGC GCTCCAGCCA CAGCACGGGT
GCCTCCTTGC GCCTCGGCAT GGAGTGGTTG CTCTTTGACA ATGGTGCCAC GCAGCTCGAT
CAGGCAGCGG CGCAGGAAAC CGTCCTTGCA ACCCGGCAGA CCCTTCTGGA CGTGGAGCAG
CAGGTGCTGT TCGCGGCTGT GCAGGCTTAT GTCAACGTCC TGACTCAGCA GGATATCGTT
GCGCTGCGTC AGAACAACCT GCGCTTGCTG CAGGAAGAGC TGCGCGCTGC CAACGACCGT
TTCGAAGTGG GCGAAGTGAC GCGCACCGAT GTCGCTCTGG CCGAAAGCCG GGTGGCTGAG
GCACGTGCCA ACCTGACCGA TGCACGCGGT GCCTTGCTGA CCGCACGCGC CACCTATGAA
GAGGTGGTCG GACGCGCGCC GGGGGCGGTG GCGTCCTATC CGCCGTTGCC GCGCCGCGCG
GCATCGCTTT CGGATGCACA GGCGCTGGCG TTGCGCAGCC ATCCGAGCCT GCGTGCGCAG
CAGCATTCGG TGCGGGCGGC GCAGCTTACC GCCGACAGCT CTTTGCGCGA CATGGGCCCC
AATGTGAAAT TCACCGCCAG TGCGACTCAT AGCGAGGTGC ACAGCAGCGA CTCCAATGCT
GACAACTTTG ATCTCGGCCT TTCGCTCAAC CAGACCCTTT ATGCGGGTGG CGCTCTGGCT
GCCGCGCGCC GGGCGGATCT GGCACGCCTC GATGCAGAGC GCGGCGCTTT GATTTCGATT
CAGCGCAGCA TCTCCAGCAG CGCAGCAAGT GCCTATACCG CGTTTGAAAC CGCCGCCGCG
AGCCTTGTGT CCTCGAACCA GCGCGTGCGC GCAGCGCAGG TCGCTTTTGA CGGGATCCGC
GAGGAAGCCA CGCTGGGATC GCGCACCACG CTTGATGTGC TTCAGGCCGA GCAGGAGCTT
CTGGATGCAC AAACGGCCCG TTTGTCGGCG CGTGCCAATC AGGCGCTGGC GGCTTATCAG
CTGTTGCAGG CGCAGGGTTT GTTGACGGCT GAGAACCTCA ATCTCGCGGT CGAGCGATAT
GACCCCGAAC TCTATCACAA CCAGGTGCGT TCGGCTCCGG CCTTTGTCAC CAAGCGTGGG
CAGGATCTGG ACCGCGTGCT CAAGGCTCTG AGAAAAAACT GA
 
Protein sequence
MQKLLKSKRI KAFATIGVLA TAVLSAPQAR ADNVTDAMIG AYTSSGLLEQ NRALLRAADE 
GVAATVARLR PVITATLAIA RDYNRTGFGT GGARSSHSTG ASLRLGMEWL LFDNGATQLD
QAAAQETVLA TRQTLLDVEQ QVLFAAVQAY VNVLTQQDIV ALRQNNLRLL QEELRAANDR
FEVGEVTRTD VALAESRVAE ARANLTDARG ALLTARATYE EVVGRAPGAV ASYPPLPRRA
ASLSDAQALA LRSHPSLRAQ QHSVRAAQLT ADSSLRDMGP NVKFTASATH SEVHSSDSNA
DNFDLGLSLN QTLYAGGALA AARRADLARL DAERGALISI QRSISSSAAS AYTAFETAAA
SLVSSNQRVR AAQVAFDGIR EEATLGSRTT LDVLQAEQEL LDAQTARLSA RANQALAAYQ
LLQAQGLLTA ENLNLAVERY DPELYHNQVR SAPAFVTKRG QDLDRVLKAL RKN