Gene Nther_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1967 
Symbol 
ID6315914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2074244 
End bp2076256 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content35% 
IMG OID642644357 
ProductProlyl oligopeptidase 
Protein accessionYP_001918125 
Protein GI188586580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00433613 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.203518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTA AAAATTCTCG GATTGTAGTG GAAGATTTTC ATGGTACTAA GGTATATGAT 
CCCTACAGGT GGCTAGAGGA TGAGAAAGCC CCTGAGGTTA AAGAGTGGAG GGAAAGAGAA
CAGGAACAGA CCAAAAATTT TTTAGAAGGT GATTTAAAAA CTAAAGTTAA ACAACGGTTA
GAAAAACTGT ACAGTTTTCC ACAACTATAT ATTCCTGTAA AAAAAGGACA GCGCTATTTT
TATCAATATC ATGATGGCTT GCAAAATCAG CCGGTCTTGT ATTTTAGAGA AGCAAATGAG
GATAAAGAAA AGCTATTGGT AGACCCCAAT AAATTTAGTG ATGACGGTAC TACAGCTATA
ACAGTTTTTT TCCCTAGTGA TGACGGTAAA TTACTAGCTT ATTCTCTATC ACGAAAAGGA
AGTGACTGGC AGGAAATATA TATAATAAAT GTAGAAACTG GAGAAAAATA TCCTGAAACT
ATCCAATATT GCAGATTCAC TAATGTAGCA TGGTCCAAAG ATAATCTGGG TTTTTATTAT
AGTAGATTTC CAAACCCTGA AGGGGTTGCA GAAGAAGATC AAAATAAATA CAACAAAGTC
TATTACCATA AAATTGGCAG AGACCAGTCT AATGATGAGT TAATTTATGA AGACAATCAT
GACAAAGAAC TGGTATTTAA TCCTTTTCTC ACTCATGACG GCGAATATAT ATGCTTGTTC
GTCCGCAAAG GAACAGATCC TCGAAATGGT TTTTATATAA AAAAAGCTGA TTCTGAAGAT
AACTTTACAA AGTTATTCCC TCAAGGTGAA GCAATGTATA AGCCTATGGG GATAATTGGG
AACACTTTTT ATTTTTTAAG TGATAAACAA GCACCTAAAG GAAAGATTAT AGCTGTTGAT
TTAAACAACC AAACCCAGAA AACTGTAATC GCAGAAACAG ATAAAATAAT CTCCGATGCG
GCTGTAATTA ACAATCATCT AGTTTTAGTT TATCAGGATC ATGGAAGCCA TCTCGTAAAT
ATTTTCAATT TAGACGGAGT TAAAGTTGAT CAGATAACAT TAGCTGATTA TGCTTCAATT
TCTGGGTTGT CGGGGCAGCC TAATGACCCG GAAATGTTTA TCGCATATAA TACATTATTG
CGTCCTACGA CTATTTTGAG ATATACTTTT GATGGTGAAT CGGAAATTTA TAAAACTCCA
GAAATCTCCT ATGAGCTAAG GGATTTTGAA AGTAAGCAAA TATTTTATGA GTCGAAAGAT
GGAACTCAAG TTCCAATGTT TTTGATATAT AAAAAAGGAT TGGAATTGAA TGGTAATAAT
CCGGCCTTGA TATTTGGCTA CGGTGGTTTT AAGATAAGCA TGAACCCTAG ATTTTCGCCC
GCTAATATAA AATGGATCGA AGAGGGTGGA ATATTTGCTA TAGCTTGTAT TAGGGGTGGA
AATGAATATG GAGAAGACTG GCACAGACAG GGAATGCTAT TGAATAAACA AAATGTATTT
GACGACTTTA TAGCTGCTGG GGAATGGTTG ATTGACAATA ATTATACCCG CAAAGACAAA
CTAGCTATTA CAGGTCGCAG TAATGGTGGC TTATTAGTAG CTGCATGTAT GACCCAACGA
CCAGATTTAT ACGGTGCAGT AGTTTGTGGG GTACCAGTTA TTGATATGCT GCGATTCCAT
AAGTTTACCA TTGGACGCTA CTGGATCCCT GAATACGGTG ATCCTGATAA TGATCCTCAG
GCTTTTGAAA ATTTATATAG TTATTCACCT CTCCATAATA TATCTAAGGG TGAAGTTTAT
CCACATACCC TTGTTTTAAC TGCTGATACT GATGATAGAG TTGTCCCAGC TCATGCTTTA
AAATTTGTTC GGGCACTTAA GGATAATGCA AAGAATAATC AGGATATTTT CCTAAGAATG
GAGAAAAAAG CTGGTCATGG ATTAGGGAAA CCTATAGGAA AAAGAATTGA AGAAGATGCT
GACTGGTTAA GCTTTTTATT AAAAGTTCTT TAA
 
Protein sequence
MSSKNSRIVV EDFHGTKVYD PYRWLEDEKA PEVKEWRERE QEQTKNFLEG DLKTKVKQRL 
EKLYSFPQLY IPVKKGQRYF YQYHDGLQNQ PVLYFREANE DKEKLLVDPN KFSDDGTTAI
TVFFPSDDGK LLAYSLSRKG SDWQEIYIIN VETGEKYPET IQYCRFTNVA WSKDNLGFYY
SRFPNPEGVA EEDQNKYNKV YYHKIGRDQS NDELIYEDNH DKELVFNPFL THDGEYICLF
VRKGTDPRNG FYIKKADSED NFTKLFPQGE AMYKPMGIIG NTFYFLSDKQ APKGKIIAVD
LNNQTQKTVI AETDKIISDA AVINNHLVLV YQDHGSHLVN IFNLDGVKVD QITLADYASI
SGLSGQPNDP EMFIAYNTLL RPTTILRYTF DGESEIYKTP EISYELRDFE SKQIFYESKD
GTQVPMFLIY KKGLELNGNN PALIFGYGGF KISMNPRFSP ANIKWIEEGG IFAIACIRGG
NEYGEDWHRQ GMLLNKQNVF DDFIAAGEWL IDNNYTRKDK LAITGRSNGG LLVAACMTQR
PDLYGAVVCG VPVIDMLRFH KFTIGRYWIP EYGDPDNDPQ AFENLYSYSP LHNISKGEVY
PHTLVLTADT DDRVVPAHAL KFVRALKDNA KNNQDIFLRM EKKAGHGLGK PIGKRIEEDA
DWLSFLLKVL