Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1991 |
Symbol | |
ID | 4077175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2096154 |
End bp | 2097575 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007306 |
Product | Type I secretion outer membrane protein, TolC |
Protein accession | YP_613985 |
Protein GI | 99081831 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.533391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAC TATTGAAATC AAAGCGTATT AAGGCTTTTG CGACAATCGG CGTCCTCGCC ACGGCCGTCT TGAGCGCCCC GCAGGCGCGT GCGGATAATG TCACCGACGC GATGATCGGC GCCTATACAA GCAGCGGCCT TCTGGAACAG AACCGAGCCC TGCTGCGTGC CGCAGACGAA GGGGTTGCGG CCACGGTTGC GCGGCTGCGT CCGGTGATTA CAGCCACCCT CGCGATTGCA CGCGATTACA ACCGCACGGG CTTTGGGACT GGGGGCGCGC GCTCCAGCCA CAGCACGGGT GCCTCCTTGC GCCTCGGCAT GGAGTGGTTG CTCTTTGACA ATGGTGCCAC GCAGCTCGAT CAGGCAGCGG CGCAGGAAAC CGTCCTTGCA ACCCGGCAGA CCCTTCTGGA CGTGGAGCAG CAGGTGCTGT TCGCGGCTGT GCAGGCTTAT GTCAACGTCC TGACTCAGCA GGATATCGTT GCGCTGCGTC AGAACAACCT GCGCTTGCTG CAGGAAGAGC TGCGCGCTGC CAACGACCGT TTCGAAGTGG GCGAAGTGAC GCGCACCGAT GTCGCTCTGG CCGAAAGCCG GGTGGCTGAG GCACGTGCCA ACCTGACCGA TGCACGCGGT GCCTTGCTGA CCGCACGCGC CACCTATGAA GAGGTGGTCG GACGCGCGCC GGGGGCGGTG GCGTCCTATC CGCCGTTGCC GCGCCGCGCG GCATCGCTTT CGGATGCACA GGCGCTGGCG TTGCGCAGCC ATCCGAGCCT GCGTGCGCAG CAGCATTCGG TGCGGGCGGC GCAGCTTACC GCCGACAGCT CTTTGCGCGA CATGGGCCCC AATGTGAAAT TCACCGCCAG TGCGACTCAT AGCGAGGTGC ACAGCAGCGA CTCCAATGCT GACAACTTTG ATCTCGGCCT TTCGCTCAAC CAGACCCTTT ATGCGGGTGG CGCTCTGGCT GCCGCGCGCC GGGCGGATCT GGCACGCCTC GATGCAGAGC GCGGCGCTTT GATTTCGATT CAGCGCAGCA TCTCCAGCAG CGCAGCAAGT GCCTATACCG CGTTTGAAAC CGCCGCCGCG AGCCTTGTGT CCTCGAACCA GCGCGTGCGC GCAGCGCAGG TCGCTTTTGA CGGGATCCGC GAGGAAGCCA CGCTGGGATC GCGCACCACG CTTGATGTGC TTCAGGCCGA GCAGGAGCTT CTGGATGCAC AAACGGCCCG TTTGTCGGCG CGTGCCAATC AGGCGCTGGC GGCTTATCAG CTGTTGCAGG CGCAGGGTTT GTTGACGGCT GAGAACCTCA ATCTCGCGGT CGAGCGATAT GACCCCGAAC TCTATCACAA CCAGGTGCGT TCGGCTCCGG CCTTTGTCAC CAAGCGTGGG CAGGATCTGG ACCGCGTGCT CAAGGCTCTG AGAAAAAACT GA
|
Protein sequence | MQKLLKSKRI KAFATIGVLA TAVLSAPQAR ADNVTDAMIG AYTSSGLLEQ NRALLRAADE GVAATVARLR PVITATLAIA RDYNRTGFGT GGARSSHSTG ASLRLGMEWL LFDNGATQLD QAAAQETVLA TRQTLLDVEQ QVLFAAVQAY VNVLTQQDIV ALRQNNLRLL QEELRAANDR FEVGEVTRTD VALAESRVAE ARANLTDARG ALLTARATYE EVVGRAPGAV ASYPPLPRRA ASLSDAQALA LRSHPSLRAQ QHSVRAAQLT ADSSLRDMGP NVKFTASATH SEVHSSDSNA DNFDLGLSLN QTLYAGGALA AARRADLARL DAERGALISI QRSISSSAAS AYTAFETAAA SLVSSNQRVR AAQVAFDGIR EEATLGSRTT LDVLQAEQEL LDAQTARLSA RANQALAAYQ LLQAQGLLTA ENLNLAVERY DPELYHNQVR SAPAFVTKRG QDLDRVLKAL RKN
|
| |