Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1154 |
Symbol | |
ID | 6315720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1220591 |
End bp | 1222051 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642643527 |
Product | transposase IS4 family protein |
Protein accession | YP_001917325 |
Protein GI | 188585780 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.543034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.461467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGGTAG TAAATGACTC AATTGATGAA CTAGATGATA ATTTATTCTT AAAATATTAT CCAGGTGGAG GGAGAAGCAG TTATCATCCA AAAATGATGA CTAAAATTTT AGTATATGCT TATACACAAA AGATATATAC TTCACGTCAA ATAGCCAAAG CAGTTAGAGA GCAACTGCCT TTCATGTGGA TAGCAGCACG ACAAAAACCT GATTTTAGAA CAATTAATAG GTTTAGATCA GAGCGGATGA AATACGTAAT AGACGAAGTG TTTGCATCAG TGCTTGAACT TTTGATAAAA GAAGGTTACG TAAAATTTGA AAATTACTTT TTAGATGGCA CTAAAGTAGA AGCTAATGCA AACCGTTACA GTTTTGTATG GAAGAAGTCC ACAGACCGCT ATGAAGCTAA TCTTCAAGCT AAAATAAAAG AATTATTACA AGAGATAGAA GAGGAAAATG AGCGAGAAAA TGAAATATAT GGTGATAAAG ATTTAGATGA ATTGGGAGAA GACAGTCAAA TAACAAGTGA AGATCTGGAG AAAACCGTAG AAAAACTAGA ATCACGTTTA AAAGAAGAAC CAAAAAACAA AAAAGTGAAG AAAGCAGTAA AAACAATAAA AAAAGACTAC TTACCCCGGA CACGAAAGTA TGAAAAATAT CAGTCAACTT TTAATGGCCG AAACAGTTTT TCAAAAACAG ATAAAGATGC TACTTTTATG CGCATGAAAG AAGATCACAT GAAAAATGGT CAGTTAAAAC CAGGATACAA CATTCAGCTG GGAACAGAAA ACCAATTCAT TTTAGGATAT AGTATTCACC AGAAACCAAC TGACACAACT TGTTTAATTC CGCATTTAGA AAAACTTGAA GAACAGCTTG GTAATGTTCC TCAAAACATA ATTGCTGATG CTGGTTACGG AAGTGAAGAA AACTATAGAT ATTTAGAAGA AAAAGATAGG AATGCTTACG TTAAGTATAA TACATATTTC CAGGAACAAA AACGTAGTTG GAGAAAGAAA ATATTCCGAA GAGAAAATAT GCATTATGAT GCTAAAAATG ATAAATTCAT CTGTCCAAAT GGGAAAGAGT TACACTTTCA ATATGAAAAA AGTTACAAAA CTGAAGCTGG TTATATTACA AAACGGCGAC TGTATAGATG TTTTGATTGC CAAGAATGTG AGTTAAAAGA AAAGTGTACT AAGTCTAAAA AAGGTAGGAC TGTTTGGGTC AATTGGGAAT TAGAAAGTTA CAAACAAAAA GCTAGAGAAA ACCTTGGAAC TGATCATGGA AGAGAGCTCT CTTCTCAAAG GAAAATTGAT GTGGAGAGTG TAAATGGTCA TTTCAAGGCT AATCGTATGT TTAGGCGATT TATGCTCCGT GGGCTAGATA AAGTTAATAT CGAACTTGGA TTAATCAGTT TAGCACATAA TATGATTAAG AAGGCAGCCA TAGGATTCTA G
|
Protein sequence | MRVVNDSIDE LDDNLFLKYY PGGGRSSYHP KMMTKILVYA YTQKIYTSRQ IAKAVREQLP FMWIAARQKP DFRTINRFRS ERMKYVIDEV FASVLELLIK EGYVKFENYF LDGTKVEANA NRYSFVWKKS TDRYEANLQA KIKELLQEIE EENERENEIY GDKDLDELGE DSQITSEDLE KTVEKLESRL KEEPKNKKVK KAVKTIKKDY LPRTRKYEKY QSTFNGRNSF SKTDKDATFM RMKEDHMKNG QLKPGYNIQL GTENQFILGY SIHQKPTDTT CLIPHLEKLE EQLGNVPQNI IADAGYGSEE NYRYLEEKDR NAYVKYNTYF QEQKRSWRKK IFRRENMHYD AKNDKFICPN GKELHFQYEK SYKTEAGYIT KRRLYRCFDC QECELKEKCT KSKKGRTVWV NWELESYKQK ARENLGTDHG RELSSQRKID VESVNGHFKA NRMFRRFMLR GLDKVNIELG LISLAHNMIK KAAIGF
|
| |