Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1967 |
Symbol | |
ID | 6315914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2074244 |
End bp | 2076256 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642644357 |
Product | Prolyl oligopeptidase |
Protein accession | YP_001918125 |
Protein GI | 188586580 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00433613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.203518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTA AAAATTCTCG GATTGTAGTG GAAGATTTTC ATGGTACTAA GGTATATGAT CCCTACAGGT GGCTAGAGGA TGAGAAAGCC CCTGAGGTTA AAGAGTGGAG GGAAAGAGAA CAGGAACAGA CCAAAAATTT TTTAGAAGGT GATTTAAAAA CTAAAGTTAA ACAACGGTTA GAAAAACTGT ACAGTTTTCC ACAACTATAT ATTCCTGTAA AAAAAGGACA GCGCTATTTT TATCAATATC ATGATGGCTT GCAAAATCAG CCGGTCTTGT ATTTTAGAGA AGCAAATGAG GATAAAGAAA AGCTATTGGT AGACCCCAAT AAATTTAGTG ATGACGGTAC TACAGCTATA ACAGTTTTTT TCCCTAGTGA TGACGGTAAA TTACTAGCTT ATTCTCTATC ACGAAAAGGA AGTGACTGGC AGGAAATATA TATAATAAAT GTAGAAACTG GAGAAAAATA TCCTGAAACT ATCCAATATT GCAGATTCAC TAATGTAGCA TGGTCCAAAG ATAATCTGGG TTTTTATTAT AGTAGATTTC CAAACCCTGA AGGGGTTGCA GAAGAAGATC AAAATAAATA CAACAAAGTC TATTACCATA AAATTGGCAG AGACCAGTCT AATGATGAGT TAATTTATGA AGACAATCAT GACAAAGAAC TGGTATTTAA TCCTTTTCTC ACTCATGACG GCGAATATAT ATGCTTGTTC GTCCGCAAAG GAACAGATCC TCGAAATGGT TTTTATATAA AAAAAGCTGA TTCTGAAGAT AACTTTACAA AGTTATTCCC TCAAGGTGAA GCAATGTATA AGCCTATGGG GATAATTGGG AACACTTTTT ATTTTTTAAG TGATAAACAA GCACCTAAAG GAAAGATTAT AGCTGTTGAT TTAAACAACC AAACCCAGAA AACTGTAATC GCAGAAACAG ATAAAATAAT CTCCGATGCG GCTGTAATTA ACAATCATCT AGTTTTAGTT TATCAGGATC ATGGAAGCCA TCTCGTAAAT ATTTTCAATT TAGACGGAGT TAAAGTTGAT CAGATAACAT TAGCTGATTA TGCTTCAATT TCTGGGTTGT CGGGGCAGCC TAATGACCCG GAAATGTTTA TCGCATATAA TACATTATTG CGTCCTACGA CTATTTTGAG ATATACTTTT GATGGTGAAT CGGAAATTTA TAAAACTCCA GAAATCTCCT ATGAGCTAAG GGATTTTGAA AGTAAGCAAA TATTTTATGA GTCGAAAGAT GGAACTCAAG TTCCAATGTT TTTGATATAT AAAAAAGGAT TGGAATTGAA TGGTAATAAT CCGGCCTTGA TATTTGGCTA CGGTGGTTTT AAGATAAGCA TGAACCCTAG ATTTTCGCCC GCTAATATAA AATGGATCGA AGAGGGTGGA ATATTTGCTA TAGCTTGTAT TAGGGGTGGA AATGAATATG GAGAAGACTG GCACAGACAG GGAATGCTAT TGAATAAACA AAATGTATTT GACGACTTTA TAGCTGCTGG GGAATGGTTG ATTGACAATA ATTATACCCG CAAAGACAAA CTAGCTATTA CAGGTCGCAG TAATGGTGGC TTATTAGTAG CTGCATGTAT GACCCAACGA CCAGATTTAT ACGGTGCAGT AGTTTGTGGG GTACCAGTTA TTGATATGCT GCGATTCCAT AAGTTTACCA TTGGACGCTA CTGGATCCCT GAATACGGTG ATCCTGATAA TGATCCTCAG GCTTTTGAAA ATTTATATAG TTATTCACCT CTCCATAATA TATCTAAGGG TGAAGTTTAT CCACATACCC TTGTTTTAAC TGCTGATACT GATGATAGAG TTGTCCCAGC TCATGCTTTA AAATTTGTTC GGGCACTTAA GGATAATGCA AAGAATAATC AGGATATTTT CCTAAGAATG GAGAAAAAAG CTGGTCATGG ATTAGGGAAA CCTATAGGAA AAAGAATTGA AGAAGATGCT GACTGGTTAA GCTTTTTATT AAAAGTTCTT TAA
|
Protein sequence | MSSKNSRIVV EDFHGTKVYD PYRWLEDEKA PEVKEWRERE QEQTKNFLEG DLKTKVKQRL EKLYSFPQLY IPVKKGQRYF YQYHDGLQNQ PVLYFREANE DKEKLLVDPN KFSDDGTTAI TVFFPSDDGK LLAYSLSRKG SDWQEIYIIN VETGEKYPET IQYCRFTNVA WSKDNLGFYY SRFPNPEGVA EEDQNKYNKV YYHKIGRDQS NDELIYEDNH DKELVFNPFL THDGEYICLF VRKGTDPRNG FYIKKADSED NFTKLFPQGE AMYKPMGIIG NTFYFLSDKQ APKGKIIAVD LNNQTQKTVI AETDKIISDA AVINNHLVLV YQDHGSHLVN IFNLDGVKVD QITLADYASI SGLSGQPNDP EMFIAYNTLL RPTTILRYTF DGESEIYKTP EISYELRDFE SKQIFYESKD GTQVPMFLIY KKGLELNGNN PALIFGYGGF KISMNPRFSP ANIKWIEEGG IFAIACIRGG NEYGEDWHRQ GMLLNKQNVF DDFIAAGEWL IDNNYTRKDK LAITGRSNGG LLVAACMTQR PDLYGAVVCG VPVIDMLRFH KFTIGRYWIP EYGDPDNDPQ AFENLYSYSP LHNISKGEVY PHTLVLTADT DDRVVPAHAL KFVRALKDNA KNNQDIFLRM EKKAGHGLGK PIGKRIEEDA DWLSFLLKVL
|
| |