Gene Nther_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1034 
Symbol 
ID6315619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1100262 
End bp1101470 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content37% 
IMG OID642643406 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001917206 
Protein GI188585661 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000872071 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATC CTGTACTAAG TATCGATGTT TCAAAAGAAA ATAGTACTGC TGCTCTGTTT 
CTTTCTCAAG GTAATTTGAA AGACAAAACC TTCACCTTTA AGCACACCCA TAAGGAGTTA
TCTGAAGTTT TGGAGATCCT TAAGTCTACT GAAATAGAAA CTGGTTCTAA AGCTAAAGTT
GTTCTAGAGG CCACCGGAAA CTACTCCACC CCTATTGTCA GTTTTTTTGA AACTAATGGA
TTTAAAGTTA TATCATTGAA TCCCATTGAA ACCCACTTAC AAAAAAATAA AGCTGTAAGA
AAGGTTAAAA CTGATGCTAT TGATGTCGTT AGAATTGCCA ATGTTTATTA TCTTAAAGGA
GATCAATTAC AGTTTAGGAT GGATGATCAG GCTAGAAATC TACGAACTAT GTGTAGACAG
TATGATGGTA TTTGTGAGGT GTTTAGCGAA ACGCAGCTTA AATTTAGAGA CTTTGTTGAA
ATGGTCTTTC CTATGTATAA TGGGATTTTT TCTGATTTTT GTTCTAAAAC TTCTCTAAAT
GTACTTTATC ATTTCCCTTC TCCAAATGCT GTACTTGAAG CTTCTAGAGA AGAGATAATC
AAAGCTTTGA AATTGGCTAA CATGCCTAAA AAGTGGTACC AAGATAAGGT GGACATGCTT
TACAATGCTG CCAGAGAAAG CCTATCTGTT AACCTGTCCC AAGGGCCTTT TGAAGAAGCT
ATCAAGGATT ATATTTCTAT CCTCAATAAC TTAAGAGACA ATTTGACCCG TATGCGGGAT
CGAATGGTTA AGCTTGCTAG GCTTTCACCT CAATTTGAGC TCCTCCTCTC CATCCCAGGG
GTAGGAGAAG TGACAGCTGC CACCATTCTT TCTGAAATCG GTGACGTCCT TAACTTCCCT
ACTGTTAAAC AACTAGTCGC TTTTTCAGGA CTAGATCCCA GCGTTTTTCA GTCAGGAAGG
TTTAAGGCTA CTAAAAACAA GATATCTAAA AGAGGATCGA ATCATCTTAG AAAAGCTCTT
TATCAGGCTA CTGTTGCAGG TATTAGAAAG AGAAAAGGAA AGCCTGTAAA CCCCGTGATA
TTTGAGTTCT ATTCTAAAAA GCTTTCCGAG GGTAAGGCGC CTAATGTTGC AATTATTGCA
GCTTCTAACA AGCTATTAAG AATAATTTAT GGCATGTTGA AGAGTAGAAC CATTTTCTCG
ACCTCTTGA
 
Protein sequence
MTDPVLSIDV SKENSTAALF LSQGNLKDKT FTFKHTHKEL SEVLEILKST EIETGSKAKV 
VLEATGNYST PIVSFFETNG FKVISLNPIE THLQKNKAVR KVKTDAIDVV RIANVYYLKG
DQLQFRMDDQ ARNLRTMCRQ YDGICEVFSE TQLKFRDFVE MVFPMYNGIF SDFCSKTSLN
VLYHFPSPNA VLEASREEII KALKLANMPK KWYQDKVDML YNAARESLSV NLSQGPFEEA
IKDYISILNN LRDNLTRMRD RMVKLARLSP QFELLLSIPG VGEVTAATIL SEIGDVLNFP
TVKQLVAFSG LDPSVFQSGR FKATKNKISK RGSNHLRKAL YQATVAGIRK RKGKPVNPVI
FEFYSKKLSE GKAPNVAIIA ASNKLLRIIY GMLKSRTIFS TS