Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1034 |
Symbol | |
ID | 6315619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1100262 |
End bp | 1101470 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642643406 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001917206 |
Protein GI | 188585661 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000872071 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATC CTGTACTAAG TATCGATGTT TCAAAAGAAA ATAGTACTGC TGCTCTGTTT CTTTCTCAAG GTAATTTGAA AGACAAAACC TTCACCTTTA AGCACACCCA TAAGGAGTTA TCTGAAGTTT TGGAGATCCT TAAGTCTACT GAAATAGAAA CTGGTTCTAA AGCTAAAGTT GTTCTAGAGG CCACCGGAAA CTACTCCACC CCTATTGTCA GTTTTTTTGA AACTAATGGA TTTAAAGTTA TATCATTGAA TCCCATTGAA ACCCACTTAC AAAAAAATAA AGCTGTAAGA AAGGTTAAAA CTGATGCTAT TGATGTCGTT AGAATTGCCA ATGTTTATTA TCTTAAAGGA GATCAATTAC AGTTTAGGAT GGATGATCAG GCTAGAAATC TACGAACTAT GTGTAGACAG TATGATGGTA TTTGTGAGGT GTTTAGCGAA ACGCAGCTTA AATTTAGAGA CTTTGTTGAA ATGGTCTTTC CTATGTATAA TGGGATTTTT TCTGATTTTT GTTCTAAAAC TTCTCTAAAT GTACTTTATC ATTTCCCTTC TCCAAATGCT GTACTTGAAG CTTCTAGAGA AGAGATAATC AAAGCTTTGA AATTGGCTAA CATGCCTAAA AAGTGGTACC AAGATAAGGT GGACATGCTT TACAATGCTG CCAGAGAAAG CCTATCTGTT AACCTGTCCC AAGGGCCTTT TGAAGAAGCT ATCAAGGATT ATATTTCTAT CCTCAATAAC TTAAGAGACA ATTTGACCCG TATGCGGGAT CGAATGGTTA AGCTTGCTAG GCTTTCACCT CAATTTGAGC TCCTCCTCTC CATCCCAGGG GTAGGAGAAG TGACAGCTGC CACCATTCTT TCTGAAATCG GTGACGTCCT TAACTTCCCT ACTGTTAAAC AACTAGTCGC TTTTTCAGGA CTAGATCCCA GCGTTTTTCA GTCAGGAAGG TTTAAGGCTA CTAAAAACAA GATATCTAAA AGAGGATCGA ATCATCTTAG AAAAGCTCTT TATCAGGCTA CTGTTGCAGG TATTAGAAAG AGAAAAGGAA AGCCTGTAAA CCCCGTGATA TTTGAGTTCT ATTCTAAAAA GCTTTCCGAG GGTAAGGCGC CTAATGTTGC AATTATTGCA GCTTCTAACA AGCTATTAAG AATAATTTAT GGCATGTTGA AGAGTAGAAC CATTTTCTCG ACCTCTTGA
|
Protein sequence | MTDPVLSIDV SKENSTAALF LSQGNLKDKT FTFKHTHKEL SEVLEILKST EIETGSKAKV VLEATGNYST PIVSFFETNG FKVISLNPIE THLQKNKAVR KVKTDAIDVV RIANVYYLKG DQLQFRMDDQ ARNLRTMCRQ YDGICEVFSE TQLKFRDFVE MVFPMYNGIF SDFCSKTSLN VLYHFPSPNA VLEASREEII KALKLANMPK KWYQDKVDML YNAARESLSV NLSQGPFEEA IKDYISILNN LRDNLTRMRD RMVKLARLSP QFELLLSIPG VGEVTAATIL SEIGDVLNFP TVKQLVAFSG LDPSVFQSGR FKATKNKISK RGSNHLRKAL YQATVAGIRK RKGKPVNPVI FEFYSKKLSE GKAPNVAIIA ASNKLLRIIY GMLKSRTIFS TS
|
| |