Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2063 |
Symbol | |
ID | 6317129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2180863 |
End bp | 2181789 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642644451 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001918218 |
Protein GI | 188586673 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000239523 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000119694 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAATT GGAACCAAAA CAAAAATAGA AATGAAAAAG AATTCGAATA TAAGGAGAAT AATCATTTAG ATAGGGATGA AGAGTTTGAA AATGAAGAAT TAGAAGATGA AGAATTTTAT TTTCCTGAAG AATACTGGGA CTTTGAATTT GATGAAGAAG ACGATGAGGA CTGGACTGAG TTTGATGAAA GAAAAAGCAA GATTCGAAAA ATATTTACAG CGGCTGTGTT AGTGATGTTT GTTATTACAG CTATGACCGG AGTATTTAAT GTTTTAGCTA ATTTTCCTAT TGATGCGTAT TTAGAATCTT TGGATTTGAG AGATAACCCC CAAGTAAAAG AACTGAAAAA AAGTGTTGTT ATGGTATCAG GACATGGAGA AGCCCAAAAA AACTCTATTT CTACAAGACA GGCGGGCTCT GGTTTTAATA TTGATCCTTC GGGTAAAATA CTGACTAATC GTCATGTTGT TGAAGACGCT ACTAATATTT CCGTGAATTT CAGAGAAGAA GAAAAGGGAT TTCCTGTTGA GGAATGGCAC GGTGCTCCTT ACCCAAATAT CGATATGGCT ATTTTAGAAA TTCAGGGAGA AAATTTACCC TATGTTGAGC TTAAGGATGA TCCGATCGCA TCTTTAGACA AAGGTCAAGA TGTTTTGATT ATTGGTAATC CCAGGGGGAT AGGGAGCCTG GCTGTAGAAG GTGAACTCAT GAAAATTCAC GAACTTTCCG GGACACCTCA TAGTATTTTA GAAATAGATG CCCATATTCA TCCTGGACAT AGTGGTAGCC CTGTATTTGA TGCAGAAGGT GAAGTAGTGG GAATTATATA CGCTTCGCGT GAAACAAATG ATGGTAAACA AGTAGGTTTA GCAGTTTCTT TAAAAGACGT GAAAGATCTA GAAAAATTTA AGGATAGAGG GGAATAA
|
Protein sequence | MKNWNQNKNR NEKEFEYKEN NHLDRDEEFE NEELEDEEFY FPEEYWDFEF DEEDDEDWTE FDERKSKIRK IFTAAVLVMF VITAMTGVFN VLANFPIDAY LESLDLRDNP QVKELKKSVV MVSGHGEAQK NSISTRQAGS GFNIDPSGKI LTNRHVVEDA TNISVNFREE EKGFPVEEWH GAPYPNIDMA ILEIQGENLP YVELKDDPIA SLDKGQDVLI IGNPRGIGSL AVEGELMKIH ELSGTPHSIL EIDAHIHPGH SGSPVFDAEG EVVGIIYASR ETNDGKQVGL AVSLKDVKDL EKFKDRGE
|
| |