Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0878 |
Symbol | |
ID | 4076248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 936737 |
End bp | 937822 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006180 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_612873 |
Protein GI | 99080719 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.228137 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCGCA GATCCTTTCT GAAAACATCC GCAATCGGTG GCTCCGCAGC CGCCGCATCC GCAGCTCTCG CCGCACCGGC CTACGCACAG GGCAATCGCA CCCTGACCAT GGTCACCACC TGGGGCCGTG GCCTTGCTGG TGTGCATGAC TCCGCGCAGC GCGTGGCAGA CAGCATCACC GCCATGACCG ACGGCGCTCT GACCGTTGAC CTCAAAGCCG CTGGCGAACT CGTTGGCGCG TTCGAGGTGT TTGACGCCGT GACCGCGGGT CAGGCCGACA TGTACCACGG TGCCGACTAT TACTTCGTGG GTCAGCACCC GGGCTATGCG TTCTTCACCG CCGTGCCTTT CGGCATGACC GCGCAGGAAC TGGCCAACTG GTACTACCAG GACGGCGGCA TGGAGCTCCA CGACGAGCTG GGTCAGATCT TTGGTCTGAA GTCCTTCCTC GGCGGCAACA CCGGCGCGCA GGCCGGTGGC TGGTTCTCCA AGGAGATCAA CAGCCCCGAC GACTTCAACG GTCTGAAGTT CCGTATGCCG GGTCTCGGCG GTAAAGCTCT GGGTAAACTG GGCGCATCCG TTCAGAACAT CCCCGGCTCC GAAGTGTATC AGGCGCTCTC CTCCGGTGCG ATCGACGGCA CCGAGTGGAT CGGCCCCTGG GCAGACGAGA AAGCGGGCTT CCAGGAAATC ACCAAGACCT ACTACACCGC TGGCTTCCAC GAGCCGGGTG CGGGTCTCTC CGTTGCGACC AACCGCGACG TGTTCGAGAG CCTCACACCG GGTCAGCAGA AGGTGATCGA GATCGCCTCT GCCGAAGCAC ACCAGTGGAA CCTGAGCCAG TTCCTCGCCA ACAACGGTGC AGCGCTGCAG CGTCTGCAGG CCGGTGGCGT GAAGGTCATG GAATTCCCCG ACTCCGTCTG GGATGCCTTC GGCAAGGCCT CCATGGAAGT GCACCAGGAA AACATGGGCG ACGATCTCTA CAAGAAGATC TACGACAGCG CCATGGCCTC CATGAAAGCC TCCTCCGGCT GGATCAACCA GTCCTCCGGC ACCTATGTGG CACAGCGCGA CCGCGTTCTG GGCTAA
|
Protein sequence | MDRRSFLKTS AIGGSAAAAS AALAAPAYAQ GNRTLTMVTT WGRGLAGVHD SAQRVADSIT AMTDGALTVD LKAAGELVGA FEVFDAVTAG QADMYHGADY YFVGQHPGYA FFTAVPFGMT AQELANWYYQ DGGMELHDEL GQIFGLKSFL GGNTGAQAGG WFSKEINSPD DFNGLKFRMP GLGGKALGKL GASVQNIPGS EVYQALSSGA IDGTEWIGPW ADEKAGFQEI TKTYYTAGFH EPGAGLSVAT NRDVFESLTP GQQKVIEIAS AEAHQWNLSQ FLANNGAALQ RLQAGGVKVM EFPDSVWDAF GKASMEVHQE NMGDDLYKKI YDSAMASMKA SSGWINQSSG TYVAQRDRVL G
|
| |