Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tfu_2212 |
Symbol | |
ID | 3580918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermobifida fusca YX |
Kingdom | Bacteria |
Replicon accession | NC_007333 |
Strand | + |
Start bp | 2602061 |
End bp | 2604169 |
Gene Length | 2109 bp |
Protein Length | 702 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637685919 |
Product | prolyl oligopeptidase |
Protein accession | YP_290268 |
Protein GI | 72162611 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGTC TCCCTTCCTA TCCCTTTGCT GAGCGCTGGG AATTGACTGA CGTGCTCCAC GGCCAGACGG TCGCGGACCC CTACCGCTGG CTGGAGCAGA CTGACCTCGC CGCCACCAAA GAGTGGTCGC AAAGCCAGGA CGCGCTGTTC ACAGCAGCAT CCGCGATGTG GTCGGCCACC GACTGGTTCC GGACGCGGCT GCGGGCCCTG ATGAATGCCG GCTCGGTGGG GGCACCGGTT TGGCGGGGAC AGCGTCGCCT GTTCGTGCGG CGCGCCGCCG ACCAGGAGCA CGCGGTGCTC TACGTCCAGG ACGGTTCAGG ACCGGAACGG GTCCTCATCG ACCCCACCGA ACTGAACCCG GAAGGCACCA CCACCCTGGA CTCGTGGGTG GTGGACTGGG AGGGACGGCT GCTGGCCTAC CAGTTGTCCG ATAACGGAGA CGAGCAGTCC CGGCTGTGGG TGATGGACAT CGACACCGGG GAGAACATCG ACGGCCCCAT CGACCGCTGC TCCTACTCCC CGGTCGCGTG GCTGCCGGGC CGTGACGCGT TCTACTATGT GCGCCGCCTC GCCCCCGAAC TGGTCCCCGA AGGAGAAGAG CAGTACCACC GGCGGGTCTA CCTGCACCGG GTGGGCACCT CCCCGGAAGA GGACACGCTG ATCTTCGGGG AAGGGCGGGA CAAAACCGAA TACTTCGGGG TGGCGGTCAG CCGCGACGGC AGGTGGCTGA CGCTCACCGC TTCGCCGGGC ACCGCGCCCC GCAACGACGC GTGGATCGCG GACCTCACCG TGTCCGCCCC GGAAGCGCCG CGCTTCACCG AGATCCAGAC CGGTGTGGAC GCTGAAACCT TCCCTCATGT GGGCCGCGAC GGCAGGCTGT ACCTCTTCAC CGACCTGGAC GCGCCGCGGG GACGGCTGTG CGTGGCCGAC CCGGCCTCGC CCACCCCGGA CCACTGGCAG ACCCTGATCG CCAGCGACCC GGACGCGGTC CTCAGCGGGT ATGCGATCCT CGACGGCGCG GAACTGGACA CTCCGGTGCT CCTCGCCCGG TGGGAGCGGC ACGCGCTCAG TGAACTCAGT GTGCACCACC TCGCCACCGG GGAGCGGATC CGGAACCTTC CCCTGCCCGG CCTCGGGTCG GTGGGCAGCA TCACCGCCCG CCCCGAAGGC GGCACGGAAG CGTGGTTCAC CTACACCGAC TACACGTCGC CGACCGCGGT CTACCGGTAC GACGCGCGCA CCCACGACGT GGTGCTCTGG CAGAAGTCCC CGGGAACAGT GGAGGCTCCC CCAGTCCGCA CCGAGCAGGT CACCTACACC TCCCGGGACG GCACCCCGGT GCGGATGCTG GTGGTCTCCC CGCCGGACCG GGAGGGGCCC CGTCCCGCCA TCCTGTACGG GTACGGCGGT TTCGGGATCT CGCTGACCCC GGCCTATTCG GCGTCGATCC TGGCGTGGGT GGCGGCCGGT GGCGCCTACG CGGTCGCCTC GCTCCGCGGC GGCCTCGAAG AAGGCGAAGA GTGGCATCGG GCAGGCATGC TCGGCAACAA GCAGAACGTT TTCGACGACT GCCATGCCGC TGCCGAATAC CTGGTCGACG CGGGGTTCAC CACTCGGGAG CAGCTTTCCG TGATGGGCGG CAGCAACGGA GGGCTGCTCG TCGGAGCCGC TATCACCCAG CGTCCCGAGT TGTATGCCGC GGCCGTGTGC TCCGCCCCGC TGCTGGACAT GGTGCGATAT GAACAATTCG GGCTGGGGCA GTTGTGGAGC GTGGAATACG GCAGCGCCAG CGACCCGGAA GCGTTGCAGT GGCTGCTCGC CTACTCCCCG TACCACAACG TCCGGGAAGG GGTGCGCTAT CCGGCGACCC TGTTCACGGT CTTCGAAAAC GACACCCGGG TCGACCCGTT GCACGCCCGC AAAATGTGTG CGGCGCTCCA ACACGCCACC TCGGCCGCTC CGGAGGAAGC CCCGATCCTG CTGCGCCGGG AGACTGACGT GGGGCACAGC ACGCGTTCCG TGAGCCGCAG CGTCCGGCTC GCCGCGGACC AGTTGGCTTT CCTCGCCCAT TACACCGGGT TGCGGGTCAC CGACCGTACT GGAGGGTAG
|
Protein sequence | MSRLPSYPFA ERWELTDVLH GQTVADPYRW LEQTDLAATK EWSQSQDALF TAASAMWSAT DWFRTRLRAL MNAGSVGAPV WRGQRRLFVR RAADQEHAVL YVQDGSGPER VLIDPTELNP EGTTTLDSWV VDWEGRLLAY QLSDNGDEQS RLWVMDIDTG ENIDGPIDRC SYSPVAWLPG RDAFYYVRRL APELVPEGEE QYHRRVYLHR VGTSPEEDTL IFGEGRDKTE YFGVAVSRDG RWLTLTASPG TAPRNDAWIA DLTVSAPEAP RFTEIQTGVD AETFPHVGRD GRLYLFTDLD APRGRLCVAD PASPTPDHWQ TLIASDPDAV LSGYAILDGA ELDTPVLLAR WERHALSELS VHHLATGERI RNLPLPGLGS VGSITARPEG GTEAWFTYTD YTSPTAVYRY DARTHDVVLW QKSPGTVEAP PVRTEQVTYT SRDGTPVRML VVSPPDREGP RPAILYGYGG FGISLTPAYS ASILAWVAAG GAYAVASLRG GLEEGEEWHR AGMLGNKQNV FDDCHAAAEY LVDAGFTTRE QLSVMGGSNG GLLVGAAITQ RPELYAAAVC SAPLLDMVRY EQFGLGQLWS VEYGSASDPE ALQWLLAYSP YHNVREGVRY PATLFTVFEN DTRVDPLHAR KMCAALQHAT SAAPEEAPIL LRRETDVGHS TRSVSRSVRL AADQLAFLAH YTGLRVTDRT GG
|
| |