Gene Nther_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1154 
Symbol 
ID6315720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1220591 
End bp1222051 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content33% 
IMG OID642643527 
Producttransposase IS4 family protein 
Protein accessionYP_001917325 
Protein GI188585780 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.543034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.461467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGTAG TAAATGACTC AATTGATGAA CTAGATGATA ATTTATTCTT AAAATATTAT 
CCAGGTGGAG GGAGAAGCAG TTATCATCCA AAAATGATGA CTAAAATTTT AGTATATGCT
TATACACAAA AGATATATAC TTCACGTCAA ATAGCCAAAG CAGTTAGAGA GCAACTGCCT
TTCATGTGGA TAGCAGCACG ACAAAAACCT GATTTTAGAA CAATTAATAG GTTTAGATCA
GAGCGGATGA AATACGTAAT AGACGAAGTG TTTGCATCAG TGCTTGAACT TTTGATAAAA
GAAGGTTACG TAAAATTTGA AAATTACTTT TTAGATGGCA CTAAAGTAGA AGCTAATGCA
AACCGTTACA GTTTTGTATG GAAGAAGTCC ACAGACCGCT ATGAAGCTAA TCTTCAAGCT
AAAATAAAAG AATTATTACA AGAGATAGAA GAGGAAAATG AGCGAGAAAA TGAAATATAT
GGTGATAAAG ATTTAGATGA ATTGGGAGAA GACAGTCAAA TAACAAGTGA AGATCTGGAG
AAAACCGTAG AAAAACTAGA ATCACGTTTA AAAGAAGAAC CAAAAAACAA AAAAGTGAAG
AAAGCAGTAA AAACAATAAA AAAAGACTAC TTACCCCGGA CACGAAAGTA TGAAAAATAT
CAGTCAACTT TTAATGGCCG AAACAGTTTT TCAAAAACAG ATAAAGATGC TACTTTTATG
CGCATGAAAG AAGATCACAT GAAAAATGGT CAGTTAAAAC CAGGATACAA CATTCAGCTG
GGAACAGAAA ACCAATTCAT TTTAGGATAT AGTATTCACC AGAAACCAAC TGACACAACT
TGTTTAATTC CGCATTTAGA AAAACTTGAA GAACAGCTTG GTAATGTTCC TCAAAACATA
ATTGCTGATG CTGGTTACGG AAGTGAAGAA AACTATAGAT ATTTAGAAGA AAAAGATAGG
AATGCTTACG TTAAGTATAA TACATATTTC CAGGAACAAA AACGTAGTTG GAGAAAGAAA
ATATTCCGAA GAGAAAATAT GCATTATGAT GCTAAAAATG ATAAATTCAT CTGTCCAAAT
GGGAAAGAGT TACACTTTCA ATATGAAAAA AGTTACAAAA CTGAAGCTGG TTATATTACA
AAACGGCGAC TGTATAGATG TTTTGATTGC CAAGAATGTG AGTTAAAAGA AAAGTGTACT
AAGTCTAAAA AAGGTAGGAC TGTTTGGGTC AATTGGGAAT TAGAAAGTTA CAAACAAAAA
GCTAGAGAAA ACCTTGGAAC TGATCATGGA AGAGAGCTCT CTTCTCAAAG GAAAATTGAT
GTGGAGAGTG TAAATGGTCA TTTCAAGGCT AATCGTATGT TTAGGCGATT TATGCTCCGT
GGGCTAGATA AAGTTAATAT CGAACTTGGA TTAATCAGTT TAGCACATAA TATGATTAAG
AAGGCAGCCA TAGGATTCTA G
 
Protein sequence
MRVVNDSIDE LDDNLFLKYY PGGGRSSYHP KMMTKILVYA YTQKIYTSRQ IAKAVREQLP 
FMWIAARQKP DFRTINRFRS ERMKYVIDEV FASVLELLIK EGYVKFENYF LDGTKVEANA
NRYSFVWKKS TDRYEANLQA KIKELLQEIE EENERENEIY GDKDLDELGE DSQITSEDLE
KTVEKLESRL KEEPKNKKVK KAVKTIKKDY LPRTRKYEKY QSTFNGRNSF SKTDKDATFM
RMKEDHMKNG QLKPGYNIQL GTENQFILGY SIHQKPTDTT CLIPHLEKLE EQLGNVPQNI
IADAGYGSEE NYRYLEEKDR NAYVKYNTYF QEQKRSWRKK IFRRENMHYD AKNDKFICPN
GKELHFQYEK SYKTEAGYIT KRRLYRCFDC QECELKEKCT KSKKGRTVWV NWELESYKQK
ARENLGTDHG RELSSQRKID VESVNGHFKA NRMFRRFMLR GLDKVNIELG LISLAHNMIK
KAAIGF