Gene Nther_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1104 
Symbol 
ID6314987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1167269 
End bp1168882 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content41% 
IMG OID642643476 
ProductATP-dependent protease LonB 
Protein accessionYP_001917275 
Protein GI188585730 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR02902] ATP-dependent protease LonB 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.286767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGAA TAGCCGGTCT CTTAGCAATG GTACAATTCT TCTTTGCAGT AGTTATCGGC 
CTTTATTTTT GGAATCAGTT AAGAAGCCAG CAGTCAAGTA AAGCTACTGT TGAGCGTGAA
TCGGAAAAAC AAATGGAAAA GCTCAGAAAA ATGAGAAGAA TATCCTTAGC AGAACCTTTA
GCAGAAAAAA CACGCCCCCA AAAATTTCAA GAAATTGTTG GTCAAAAGGA AGGGTTAAAA
GCACTGAGAG CAGCTCTTTG TGGACCTAAT CCACAACATG TGATAATTTA TGGTCCACCC
GGTATAGGAA AAACTGCAGC AGCTCGCGTA GTATTAGAGG AAGCTAAAAA CAATCCACGT
TCCCCTTTTG GTGAGCATTC TGAATTTGTG GAAATTGACT CCACTACTGC CAGGTTTGAT
GAAAGAGGGA TTGCTGATCC ATTAATCGGT ACTGTTCATG ACCCTATATA TCAAGGTGCT
GGAGCCATGG GAATACATGG TATCCCCCAG CCTAAACCTG GAGCAGTAAG TAAAGCCCAC
GGAGGAGTCT TATTTCTTGA TGAGATTGGA GAACTTCATC ATATACAGAT GAATAAACTG
TTAAAGGTCC TGGAAGACAG AAAAGTTTTT TTAGAAAGTT CATATTACAG TTCGGAGGAC
CCCAACTGTC CCCAGTATAT ACACGAGATT TTCCAAAAAG GCTTACCTGC AGATTTTCGA
CTTATCGGTG CCACCACTAA AGGACCAGAA AGTATTCCTG AAGCTATTCG AAGTCGTTGT
GTTGAAGTAT TTTTTAAGGG TCTAACTCCT GAAGAACTTG TTGGAATAGC GAAACATGCT
GTTAAAAGAC TGGGATTTGA GGTAGAACAA GGGGCCATTG AAATAATCAA AAAATATGCT
GAAAATGGTA GGGAAGCTGT TAACCTAATT CAAATAGCAG CAGGTCTTGC CCAGACAGAA
AAACGCTCCA ATTTAACCCA AAAAGACCTG GAATGGGTTA TTAACACAAG CCAAATTTCA
CCTAGACCAG ATAAACTGAT ACCCGCCAGA TCTCAAGTTG GGCTAGCCAA TGCATTAGCA
GTGCTAGGAC CTAGCCGAGG AACACTCTTA GAAATCGAAG TGTCTTCAAT CCCTTGTAAA
GAAGGTGAAG GTGAAATAAA CGTAACTGGA CTTGTGGAAG AAGAGGAAAT GGGGGGACAT
TCCCGAGTAG TTCGGCGCAA AAGTATGGCA AAAGGTTCAG TTGATAATGT GTTGACAGTA
CTAAAGCATT ATCTAGACGT TAATCCTAGG GATTACGACA TCCATATAAA TTTTCCAGGT
TCAATCCCAA TAGATGGTCC TTCTGCAGGA GTAGCCATAG CCACAGCAAT ATATTCATCT
TTATCAAATA AGCCGATTGA CCATAAAGTT GCCATGACAG GAGAAGTTTC AATTAGAGGA
GAAATAAAAC CTGTAGGAGG CGTTTCCACA AAAATAGAAG CAGCTAAAGA GGCTCAAGCC
AGTAAAGTTT TAATTCCTAA AGAAAACTAT CAAAAAACCT TTGAAGATGA AGAGTCTCTA
ACAGTGTCGC CAGTTGAGGA CTTAAAAGAA GTACTAAGAG AAGTCACCTT GTAG
 
Protein sequence
MEGIAGLLAM VQFFFAVVIG LYFWNQLRSQ QSSKATVERE SEKQMEKLRK MRRISLAEPL 
AEKTRPQKFQ EIVGQKEGLK ALRAALCGPN PQHVIIYGPP GIGKTAAARV VLEEAKNNPR
SPFGEHSEFV EIDSTTARFD ERGIADPLIG TVHDPIYQGA GAMGIHGIPQ PKPGAVSKAH
GGVLFLDEIG ELHHIQMNKL LKVLEDRKVF LESSYYSSED PNCPQYIHEI FQKGLPADFR
LIGATTKGPE SIPEAIRSRC VEVFFKGLTP EELVGIAKHA VKRLGFEVEQ GAIEIIKKYA
ENGREAVNLI QIAAGLAQTE KRSNLTQKDL EWVINTSQIS PRPDKLIPAR SQVGLANALA
VLGPSRGTLL EIEVSSIPCK EGEGEINVTG LVEEEEMGGH SRVVRRKSMA KGSVDNVLTV
LKHYLDVNPR DYDIHINFPG SIPIDGPSAG VAIATAIYSS LSNKPIDHKV AMTGEVSIRG
EIKPVGGVST KIEAAKEAQA SKVLIPKENY QKTFEDEESL TVSPVEDLKE VLREVTL