Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1742 |
Symbol | |
ID | 6317223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1807776 |
End bp | 1809164 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642644118 |
Product | protein of unknown function DUF107 |
Protein accession | YP_001917904 |
Protein GI | 188586359 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0102804 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCT GCTCCAAAAA AAACCTTAGG TTTTTAAAAG TAACATCTAT TTTGGTAATC TTGGTTTTTT TATTAAGTGC AATACAAATG CCTGTAAGTA GTGAAGAAAA CAATTCTGAC CCCAAATATA AACTATTAAC TGTTGAGGAC ACAATTACAG CAGGCACTAG TCAATATCTG GAGCAAGGAA TTGAAAATGC CATAAACCAA GGATATGACG GGGTGATTAT AGTCCTAAAT ACTCCCGGGG GGCTTGTTGA TGCCACCTTA GATATTATGG GTAAAATAGT TAATTCTCCT ATACCTGTAA TCACCTTTGT AAGTCCTTCA GGAGCCATAG CTGCTTCGGC TGGTACTTTT ATTTTAGTTA GTGGGCATGT GGCAGCTATG ACACCGGGAA GCACTTGTGG CGCTGCTATG CCAGTGACTA TGCAGCCGGG GGAAGAGGGG ACACAAGAAG CGGATCAAAA AACCATTAAT TTTTTGGCCG GTCACTTAAA AAGTGTTGCC AGGGAACAGG GTCGACCTGA AGAAGTTGTC GAGAAATTTG TAACTGAAAA TTTAACTCTG AATGCATCGG AAGCTTTGGA GAAGAATGTA ATTGAATTCA ATGAACCAAA TTTAGATGCT TTATTAACCG CTATCCATGG TCATGAAGTT ACTGTAGCAG GTGAGGAAAT CACCTTAGAA ACTGAAAACG CCAGGATAGA TGAAGCTGAA ATGACTTTTA CCCAACAGTT GAGTCACTTT ATCAGCAATC CTCAAATAGC ATTAATACTA TTTATGATCG GTATTTATGG AATTATTTTT GGTATCAATA TGCCTGGGAC AATTATACCA GAACTTGGGG GAGTCCTTTC ACTAATTTTG GCTTTATTTG GTCTAGGTAT GTTTGAAGCA AATACCCTTG GCATTATCTT AATTGTATTA TCAGTAATTT TATTTATAGC TGAGGTATTC ACACCAACTT TCGGTATATT AACTACCGTT GGTGTTATTG CATTAGTCAT AGGTGGCTTC TTCTTGCCAG TTGAACCCAT GCTCCCTCAA GCATGGTTTG ATGCTTTTCA AATGACTGTC ATAGGGATGG CTTTGGTAAC TGCAGGTTAT CTGGCTTTGG TTATAATGAA ACTACTTAAA ATACGTAAAC AGTCTGCTGT ACACAAAAAG CATGGAATGA TCGGTTATCG TGGCAGGACA ACAGAAGATT TGAATCCAGA AGGTTATATT AAAATTCGCG GTGAATTGTG GCGAGCCAAA AGCAAAGATG AGGAATTTAT AGCGAAGAAT AGAGATGTAA TAGTAGAACA AGTTGAGGGG ATTAAACTGA TGGTATCTGA AACAAAAAGG AGTGACCAAG AGGAGACGGA AAAAGAAGTT GAGGAGTAA
|
Protein sequence | MDVCSKKNLR FLKVTSILVI LVFLLSAIQM PVSSEENNSD PKYKLLTVED TITAGTSQYL EQGIENAINQ GYDGVIIVLN TPGGLVDATL DIMGKIVNSP IPVITFVSPS GAIAASAGTF ILVSGHVAAM TPGSTCGAAM PVTMQPGEEG TQEADQKTIN FLAGHLKSVA REQGRPEEVV EKFVTENLTL NASEALEKNV IEFNEPNLDA LLTAIHGHEV TVAGEEITLE TENARIDEAE MTFTQQLSHF ISNPQIALIL FMIGIYGIIF GINMPGTIIP ELGGVLSLIL ALFGLGMFEA NTLGIILIVL SVILFIAEVF TPTFGILTTV GVIALVIGGF FLPVEPMLPQ AWFDAFQMTV IGMALVTAGY LALVIMKLLK IRKQSAVHKK HGMIGYRGRT TEDLNPEGYI KIRGELWRAK SKDEEFIAKN RDVIVEQVEG IKLMVSETKR SDQEETEKEV EE
|
| |