Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_0475 |
Symbol | |
ID | 6091885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 464401 |
End bp | 466245 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642487657 |
Product | 4-phytase |
Protein accession | YP_001738514 |
Protein GI | 170288276 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC TGTTTGTGCT GTTTCTGGCA GTCCTATCAG TTCTGGTACT GGCCGAAGTG AAAAACCCGG ACACCATAAT CGACGCCACC ATTGGAGAAC CCGACACTCT CGACCCACAC TACGCCTACG ACACAGCAAG TGGTGAAGTT ATCTACAACG TGTACGAGAA CCTGATCGCC TACAAAGGAG AGAGCCTCAA AGAATTCGAA CCACGCCTTG CAGAAAGATG GGAAATTCTG GACGACGGGA AAACTTACAA GTTCTACATC AGAAAAGGTG TGAAGTTCCA CGAAGGAGGA GATCTCACAC CAGAAGACGT GGAATACAGC TTTGAGAGAG GCCTCATCTT CGATCCAACA GCGGGTCCTA TGTGGATGCT CTGGGAAGCC CTGTTCGGTG TGGATTCACT GGAAACTTTC GTTGAGGAAA AGATCGGTAA GCCTTACAGC GAACTCTTCG ACGAAAACGG TGAACCGCTT CCAGAGTACA GAGACGCTCT CATAAAGATC TACACGGATT ACATCGATCC CACCATTGAA GTTGAAGGTG ACGCCGTTGT GTTCCACCTC GTGAGACCCT TCGCACCGTT CATGTACATA CTCGCCCAGA GCGCCAGCTG GAGTGCTGTC CTCGACAAAG AGTGGTGTAT AGAGATAGGA TGCTGGGACG GAAGAGCCGA TACCTGGTGG AAGTATCACG ATATCAGAAA AGAAGATTCT CCTCTCTACG CGAGAATGAA CGGAACTGGA CCCTTCAAAT TCGTCGAATG GGACAGAGCT CAGCAGAAAG TCATCCTCGA GCGAAACGAC AACTACTGGA GAGAACCCGC GAAGATCAAA AGAGTTATCA TCTGGGGAAT CGACGAGTGG AGCACAAGAA GGGCGATGTT CCTTCAGGGA GACGCCGATA TCTGTGCTGT CCCAACCCAG TACCTCGAGC AGGTGGAAGG AAAACCCGGT GTCACCGTTA TAAAGGGACT TCCTGAACTT GCAATAACAT CCCTTCACTT CGCGTGGAGC GTTCCCGAAG ACAGCAAGTA CATAGGCTCT GGAAAACTCG ACGGAAACGG AATACCACCC GATTTCTTCA CTGATGAAAA CGTGAGAAAA GCCTTCATCT ACGCGTTCGA CTACGACACA TTCATAAACG AAGTGCTCAA AGGTCTTGGT AGAAAGATAC CAACAGACCT TCCAGAAGGA CTCCTCGGAT TCAACGAAGA GCTGCTGAAC GATCCAGACG CTCCACACTT CGATATTGTG AAAGCAACGG AGTACTTCCA GAAAGCGTGG AACGGTGAGG TCTGGAAGAA AGGATTCAAG ATCACGTTGC TTTACAACAC CGGTAACGAT GTGAGAAGAG CAGCCGCAGA AATGCTGAAG GCATACATCG AGATGATCAA TCCGAAGTTC AAGGTCGAAG TGAGAGGCGT TCAGTGGCCT ACGTATCTCG ACGCAACCAA GAGAGGAGAA GTGCCTGTCT TCATCATAGG ATGGCTCGCA GATTATCCGG ATCCTCACAA CTTCATCTTC ACATACTACC ATAGTGCAGG AGTTTACTCT GGAAGACAGG GTGAGAACTT CAGGAAGTTC ATTTCCACAC CACATCCCGA CCTTGGTGGT AGAAGCCTCG ACGAGCTCAT AGAAGAAGCG ATCGCGAAGA CCGATCCCGC AGAAAGACAG GCACTCTACG AAGAGATCCA GAGGTTCGCA ATGAAGCACG CCCTTGGTAT GCCTCTCTAC CAGCCGCTCG GTGTGAGAGT CCAGAGAAGC TGGGTCAAAG GATGGTACTA CAATCCAATG AGACCTGGTG ACGACTACTA CGTGCTCTGG AAAGCAGAAG AGTAA
|
Protein sequence | MKKLFVLFLA VLSVLVLAEV KNPDTIIDAT IGEPDTLDPH YAYDTASGEV IYNVYENLIA YKGESLKEFE PRLAERWEIL DDGKTYKFYI RKGVKFHEGG DLTPEDVEYS FERGLIFDPT AGPMWMLWEA LFGVDSLETF VEEKIGKPYS ELFDENGEPL PEYRDALIKI YTDYIDPTIE VEGDAVVFHL VRPFAPFMYI LAQSASWSAV LDKEWCIEIG CWDGRADTWW KYHDIRKEDS PLYARMNGTG PFKFVEWDRA QQKVILERND NYWREPAKIK RVIIWGIDEW STRRAMFLQG DADICAVPTQ YLEQVEGKPG VTVIKGLPEL AITSLHFAWS VPEDSKYIGS GKLDGNGIPP DFFTDENVRK AFIYAFDYDT FINEVLKGLG RKIPTDLPEG LLGFNEELLN DPDAPHFDIV KATEYFQKAW NGEVWKKGFK ITLLYNTGND VRRAAAEMLK AYIEMINPKF KVEVRGVQWP TYLDATKRGE VPVFIIGWLA DYPDPHNFIF TYYHSAGVYS GRQGENFRKF ISTPHPDLGG RSLDELIEEA IAKTDPAERQ ALYEEIQRFA MKHALGMPLY QPLGVRVQRS WVKGWYYNPM RPGDDYYVLW KAEE
|
| |