Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1177 |
Symbol | |
ID | 6315311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1247241 |
End bp | 1248317 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642643550 |
Product | germination protease |
Protein accession | YP_001917348 |
Protein GI | 188585803 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01441] GPR endopeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0671505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.000569183 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCATG ATAAGAAAGA TATGAATTCT AATAAGGAAG GTTTAAATTC AAGTATCCGA ACAGACTTGG CAATTGAAGC TATGGAAATG GTATCTCCTG AAACTGAGAA AGATATTCCA GGAGTAGAAA GTGAAACTTT TCAAGAGGAT GGAATAACAG TTAACCATCT CAGTATCACG TCACCACAAG GTCAGCAAGC TATGAATAAA GCCATGGGGA ATTATGTTAA TATTGAAGCT CCGGGCCTGC GGGAAAAAAA TACAGATCTA CAAGAAGAGG TTAGCCAGAT TGTAGCGAGA GAATTACAAA ATATTGCCCA ATTTAATGAT AGTACTGAAT TGATGGTAGT AGGATTGGGG AATTGGAATG TAACTCCCGA TTCTATTGGC CCTAAAGTAG TTGAAGATCT AGTTATCACA AGACATTTAA AACAATTAGT ACCAGAACAA TTAGGTGAAG GTTTTCGATC TATATCAGGG GTTGCACCTG GAGTCATGGG TTTAACTGGA GTAGAAACAG GTGAAATAAT CAAAGGAATT GTAGAAGAAG CTAAACCTAA TATGATATTG GCCATTGATG CTTTAGCAGC TCGAAACCTT AGTAGATTAA ACACAACAGT TCAAATTGCT GATAACGGCA TTCACCCTGG GTCCGGGGTA GGTAACGATC GAATGGGAAT AAATAAAGAG ACTATGGGGG TTCCAGTTGT TGCTATGGGA GTACCCACTG TAGTTGATGC AACTACCTTG GTAGGTGATA CTTTACAAAT GGTTAACAAT CAAGGTCAGC AAGGTCAAAA TGCTCAACAA GGACAGCAAC CAACCCAAAC TCAATCACAA GGTATGCCAG GACCAGGCAA TCCAGCTCAA GGATCTCCTG GCGGCAATCA ACAAGTTGAT CGTAATTTAG TCAATCAAGC ACTTCAACCT TATAGTGGTG AAGGACGAAC CTTGATGATT ACTCCTAAGG AAGTAGATCA GTTTGTAGAT GATATTTCTG AAGTTTTGGC TGGTGGAATA AATGTGGCAG TTCATCCTCG GGTTGCCCAG GAAAATCCCG GAAAGTATTT ACAATAA
|
Protein sequence | MSHDKKDMNS NKEGLNSSIR TDLAIEAMEM VSPETEKDIP GVESETFQED GITVNHLSIT SPQGQQAMNK AMGNYVNIEA PGLREKNTDL QEEVSQIVAR ELQNIAQFND STELMVVGLG NWNVTPDSIG PKVVEDLVIT RHLKQLVPEQ LGEGFRSISG VAPGVMGLTG VETGEIIKGI VEEAKPNMIL AIDALAARNL SRLNTTVQIA DNGIHPGSGV GNDRMGINKE TMGVPVVAMG VPTVVDATTL VGDTLQMVNN QGQQGQNAQQ GQQPTQTQSQ GMPGPGNPAQ GSPGGNQQVD RNLVNQALQP YSGEGRTLMI TPKEVDQFVD DISEVLAGGI NVAVHPRVAQ ENPGKYLQ
|
| |