Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1511 |
Symbol | |
ID | 4077067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1617419 |
End bp | 1618765 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006824 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_613506 |
Protein GI | 99081352 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.565949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.673528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGT CAGACGTAAG CCGTCGCGGC CTGCTGAGAA CCGGTGCGGT CGCGGGTGCA GGGCTCGCCA TGCCGACCAT CTTTACGGCC CAAAGCGCGC ATGCATTCAC CAACAACCCC ACTGGCGGCA CTGTTACACT CGGCTTTAAC GTCCCGCAGA CCGGCCCTTA CGCCGATGAA GGTGCGGACG AGTTGCGCGC CTATGAGCTG GCGGTCGAGC ACCTGAACGG CGGTGGCGAT GGCGGCATGC TCACCACCTT CAGCTCCAAG GCTCTGCAGG GCAATGGCAT CCTGGGCAAG AAAGTCGAAT ATGTCACCGG CGATACCCAG ACCAAATCCG ATGCGGCGCG CGCTTCTGCC AAGTCCATGA TCGAAAAAGA CGGTGCGATC ATGATCACGG GCGGCTCGTC TTCGGGTGTG GCTGTGGCCG TGCAGGCGCT CTGCCAAGAG GCAGGCGTAA TCTTTATGGC GGGTCTTACC CACTCCAATG ACACCACAGG CAAAGACAAG CGGGCCAATG GTTTCCGCCA CTTCTTCAAC TCTTACATGT CTGGTGCGGC GCTGGCGCCG GTGCTGGCGA ATGCCTACGG CACCGACCGT AAGGCCTATC ACCTGACCGC CGACTACAAC TGGGGCTATA CCACCGAAGA AGCAGTCCGG TCCTCCACCG AAGCGATGGG CTGGGAAACC GTGGCTGCGG TGAAAACACC GCTAACCCAG ACCGACTTCT CGTCCTATAT CGCCCCGGTT CTGCAGTCCG GTGCCGACAC GCTGGTTCTG AACCACTACG GCGGCAACAT GGTGAACTCT CTCACCAACG CGGTGCAGTT CGGCCTGCGC GACAAGCAGG TGAACGGCAA GGACTTCCAG ATCGTTGTTC CGCTCTACTC CCGCCTGATG GCGAAAGGTG CGGGCGCCAA CGTGAAGGGC ATCTTCGGCT CCACCAACTG GCACTGGTCG CTGCAGGACG AAGGTTCCAA GGCCTTTGTA CGCTCCTTCG GCACCAAATA CGGCTTCCCG CCGAGCCAGG CCGCTCACAC CTGCTATGTG CAGACCCTGC TCTATGCAGA CGCGGTTGAA CGCGCTGGCT CCTTTGCGCC CTGCGCCGTG GCAGAAGCGC TCGAGGACTA TGAGTTCGAC GGTCTGGGCA ACGGCAAGAC GCTCTATCGT GGCGCCGATC ACCAGTGCTT CAAGGACGTG CTGGTTGTGA AAGGGAAAGA GAACCCGACC TCGGAGTTCG ACCTTCTCGA AATCGTCGAA GTCACCCCGG TTGGCCAGGT CACCTATGAC CCGAACCACC CGCAGTTCCA GGGCGGTGCG CTCGGCACCT GCAACAACGG CGCCTAA
|
Protein sequence | MSKSDVSRRG LLRTGAVAGA GLAMPTIFTA QSAHAFTNNP TGGTVTLGFN VPQTGPYADE GADELRAYEL AVEHLNGGGD GGMLTTFSSK ALQGNGILGK KVEYVTGDTQ TKSDAARASA KSMIEKDGAI MITGGSSSGV AVAVQALCQE AGVIFMAGLT HSNDTTGKDK RANGFRHFFN SYMSGAALAP VLANAYGTDR KAYHLTADYN WGYTTEEAVR SSTEAMGWET VAAVKTPLTQ TDFSSYIAPV LQSGADTLVL NHYGGNMVNS LTNAVQFGLR DKQVNGKDFQ IVVPLYSRLM AKGAGANVKG IFGSTNWHWS LQDEGSKAFV RSFGTKYGFP PSQAAHTCYV QTLLYADAVE RAGSFAPCAV AEALEDYEFD GLGNGKTLYR GADHQCFKDV LVVKGKENPT SEFDLLEIVE VTPVGQVTYD PNHPQFQGGA LGTCNNGA
|
| |