Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3049 |
Symbol | |
ID | 4075143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 17544 |
End bp | 19208 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004550 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_611285 |
Protein GI | 99078027 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCATT TCACCAGAAC AGGACAACCG TTGCCGGCCT CGCTGACACG TCAGGCACCC GAGATGGCTG CTGATCCCGT CAGCCGCCGA GAGTTCCTGG CGACGGCCTG CTCTTTTGGC GCCACCGCCG CCACGGCCTA TGCGATGATG GGTCTGAACG CCCCGGCACA GGCTGCGGCC AATGCCCAGA TGGGGGGCAC CGTGCGCATC CAGCAGCAAC TTGTGGCCAT GCGTGACCCT CGAAAGTTTG ACTTCAATTC GCTTGCCACC TTTACGCGCG GATGGCTGGA ATACCTTGTC CAATACAACT CTGATGGCAG CTTCACCCCG ATCCTGCTGG ACAGCTGGGA AATCAGCGAG GACGCCACCG AATATTCGCT CAATGTGCGC AAAGGTGTCA CCTGGAACAA CGGCGAGCCT TTCACTGCCG ATGATGTGGC CAACAATATT GCCCGGTTCT GTGATGGCAC CGTCGAAGGC AACGCCATGG CGGGCAAAAT GCAGGTGATG ATTGATCCCG AAACCAATCA AATACTGCCG GGTGTGGTCG AAGTCACTGA CAGCCACTCG CTGAAGCTAA AGCTGCCGAA ACCGGACATT TCGCTGATCG CGAGCTTTGC CGACTATCCC GCTGCCGTTG TGCATCCGTC GTTTGAGGAA TTCTCGATGC TGGATAACCC GATCGGAACC GGGCCCTACC TGCCCGAGTT CTACAGTGTC GGTGACAGTG CGGCGCTTGT GCGCAATCCC GATCACACAT GGTGGAACGC GGGCAATGGC GGCTATATGG ACCGGGTCGA GTTCATCGAT TTCGGCGCGG ACCCTGCGGC CTTTTTTGCG GGGGCGGACG CGGATGAATA CGACGTCAAT TACGACACCG AAGGCGACTA TATGGACGCC TATGACTCGC TTGATGGCTG GACCAAGCAT GAGGTGACAA CCGCCGCCAC GGTACTTGCG CGTTGCAACC AACTGGCCGA GGTGAATGGC AAAGCTGTCT ATGCCGATGC CCGCGTCCGC CGTGCGCTTG CGATGGCCGT GGACAACGAA GTGGTGCTGG AAATCGCCTA TGGCGGGCAG GGCCGTGCAG CGGAAAACCA CCATGTGTCT CCCGTGCATC CCGATTATGC CGACATCGGT GCCCCTGTGC GCGATCCTGA GGTGGCAATG CAATTGCTGG AGGATGCCGG CATGTCGGAC TTCGAGCACG AGCTGGTCTC GCTCGATGCG GCGTTCTGGA AGGCCACGGG CGACGCCATT GCCGCGCAGC TGCGTGACGC TGGAATCAAG GTCAAACGCA CGGTGTTTCC GACCTCTACC TTCTGGAACA ACTGGGCGAA ATACCCGTTC TCGGTGACGA ACTGGAACCA CCGCCCCCTC GGCATTCAGA CACTCGCCTT GGCCTATCGC AGTGGTCAGG GATGGAACGA GTCCGGTTTT GCCAATGCGG AGTTTGACGC GCTGGTCGAA GAGGCGCTGG CAACTGCCGA TCCAGAGGCG CGCAGCAAAA TCATGGCCAA GCTTGAACAG ATCATGATCG ATGAAGGTGT CATCATCCAA CCCTATTGGC GTTCGCTCTA CAACCATTCC AAGAGCAATC TGAAGGGCGC TGAAATCCAC ATCTCCAACG AGCTTTATCC GCAGTACATG TATTGGGAAG CTTGA
|
Protein sequence | MTHFTRTGQP LPASLTRQAP EMAADPVSRR EFLATACSFG ATAATAYAMM GLNAPAQAAA NAQMGGTVRI QQQLVAMRDP RKFDFNSLAT FTRGWLEYLV QYNSDGSFTP ILLDSWEISE DATEYSLNVR KGVTWNNGEP FTADDVANNI ARFCDGTVEG NAMAGKMQVM IDPETNQILP GVVEVTDSHS LKLKLPKPDI SLIASFADYP AAVVHPSFEE FSMLDNPIGT GPYLPEFYSV GDSAALVRNP DHTWWNAGNG GYMDRVEFID FGADPAAFFA GADADEYDVN YDTEGDYMDA YDSLDGWTKH EVTTAATVLA RCNQLAEVNG KAVYADARVR RALAMAVDNE VVLEIAYGGQ GRAAENHHVS PVHPDYADIG APVRDPEVAM QLLEDAGMSD FEHELVSLDA AFWKATGDAI AAQLRDAGIK VKRTVFPTST FWNNWAKYPF SVTNWNHRPL GIQTLALAYR SGQGWNESGF ANAEFDALVE EALATADPEA RSKIMAKLEQ IMIDEGVIIQ PYWRSLYNHS KSNLKGAEIH ISNELYPQYM YWEA
|
| |