Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3641 |
Symbol | |
ID | 4075069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 698127 |
End bp | 699212 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005161 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_611870 |
Protein GI | 99078612 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.312252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.742718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGAC GCGCGTTTCT CAAGACTGGT GCCATGGGCG CAGCCGCCAC AGCCTTGGCG AGCCCAGCGA TTGCCCAAGG AAAGATGCAA TGGAAGCTGG TCACAGCCTG GCCCAAGAAC CTGCCGGGGC CGGGTGTCGC CGCACAGATG CTCGCAAACC GCATCACAAC GCTGTCGGGT GGACGTATTG AGGTCAAACT CTTTGCTGCA GGCGAGCTTG TGCCGGGACG CGGCGTCTTT GATGCGGTCT CCGAAGGCAC GGCAGAGCTC TATCATGCGG TTCCGGCCTA CTGGGGGTCC AAATCCAAGG GCATTTTGCT TTTTGGCTCG CAGCCCTTTG GCCTGCGCGC AGACGAGCAG TTTGGCTGGC TCTACCATGG TGGCGGTCAG GCGCTCTATG ACGAGATGTA TGGCCGCTTT GGCATCAAGC CTTTCCTCTG CGGTAACTCC GGCCCGCAAT GGGGCGGCTG GTTCAAAACC GAGATCAATT CCGCCGAAGA CCTGAAGGGG TTGAAGTTCC GCACCACGGG CCTTGCATCC GAAATGGCGT CGAAACTTGG CATGGCAGCC GAAGCCACAA GCGGACCGGC CATGTTCCAA GGCCTGCAAA CGGGTGCATT GGACGCAGGC GAGTTCATCG GCCCCTGGAC CGACAGCGCG CTTGGCTATT ACCAGGTCGC CAAGAACTAC TACTGGCCCG GCGTGGGTGA ACCCTCTTCT GCCGAGGAAT GCGGGGTAAA CGCTGACGTC TTTGCCGAAC TGCCGGATGA TCTCAAACAG GTTGTCTCGC TGGCCTGCGA AAGCCTCTAC AATCCGGTCT GGACGGAATA CACCACCAAG CACGCACTGG CGCTGAAGAA AATGGTCGAA GAGGACGGCG TTCAGGTCAA GATGTTCCCG TCCGACGTGA TCGAGGCCAT GGGCACCGCA GCAGCAGAAG TCATCTCTGA GCTGCGCGAA GACGAAGACG AACTTGTGCG CCGTATCACC GAGAGCTTTA TCAGCTATCG CGACAGTGTG GGCGGCTACA TGACCTATGC CGACAACGGC CAGATGAACG CCCGCGCCTC GGTTATGGGC TACTGA
|
Protein sequence | MQRRAFLKTG AMGAAATALA SPAIAQGKMQ WKLVTAWPKN LPGPGVAAQM LANRITTLSG GRIEVKLFAA GELVPGRGVF DAVSEGTAEL YHAVPAYWGS KSKGILLFGS QPFGLRADEQ FGWLYHGGGQ ALYDEMYGRF GIKPFLCGNS GPQWGGWFKT EINSAEDLKG LKFRTTGLAS EMASKLGMAA EATSGPAMFQ GLQTGALDAG EFIGPWTDSA LGYYQVAKNY YWPGVGEPSS AEECGVNADV FAELPDDLKQ VVSLACESLY NPVWTEYTTK HALALKKMVE EDGVQVKMFP SDVIEAMGTA AAEVISELRE DEDELVRRIT ESFISYRDSV GGYMTYADNG QMNARASVMG Y
|
| |