Gene Nther_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2037 
Symbol 
ID6317155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2152042 
End bp2154018 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content37% 
IMG OID642644425 
ProductExcinuclease ABC subunit B 
Protein accessionYP_001918192 
Protein GI188586647 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000674762 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGAAT TTAAATTACA ATCTGACTTT TCACTTGAAG GAGATCAACC CAAAGCTGTT 
GATGAACTGT GTGAAAGTTT AAATGGAGGT AATTCTCATC AAACACTGTT GGGAGTTACT
GGTTCAGGTA AAACCTTTAC AATGGCTAAT GTTATTCAAA GATTACAGCG TCCCACCCTT
GTTATTGCTC ATAATAAAAC ACTGGCAGCC CAATTATGTG GTGAGTTTAA AGAATTTTTC
CCCGAAAATG CAGTGGAATA TTTTGTGAGT TATTACGATT ACTATCAACC TGAAGCTTAT
ATACCTCAAA CAGATACTTA TATTGAGAAA GATGCTTCAA TTAATGATGA AATTGATAAA
TTACGCCATT CGGCAACAAG TGCTTTGTTT GAGAGAAGAG ATGTGATAAT TGTAGCTAGT
GTTTCCTGTA TATATGGTTT GGGGTCACCG GAAGAATACA GAGAACAGGT GTTATCTTTA
AGATGTGGAA TGGAAAAGGA CCGCGATGAA ATTCTAAAAG GATTAGTGGA CATCCAATAT
TCAAGAAATG ATGTTAACTT TACCCGCGGG ACCTTTCGAG TAAGAGGCGA TGTAATTGAA
GTTTTCCCAG CTTCTTATAC TGAAACAGCT GTTAGGATAG AACTGTTTGG TGATGAAATT
GAAAGGATAA CTGAAATAGA TACTTTGACA GGGGAAATAC TGGGAGAAAG AAATCATGTG
GCTATTTTTC CTGCATCCCA CTTTGTTACC CGTCGAAGCA AATTAGAAAA AGCCATTGAA
AGCATTCAGG AAGAGCTTCA TGAACAACTG GAATACTTAA AAAGACAAGG TAAAGCCGTA
GAAGCTAAAC GTTTAGAACA ACGAACCAAC TACGACTTGG AAATGCTACA AGAGATGGGT
TTTTGTCAAG GAATTGAGAA CTATTCTAGA CATTTGATCG GAAGACCTGC AGGAAGTAGA
CCTTATTGCT TAATTGACTA CTTTCCAGAT GATTATTTAA TGGTAGTAGA TGAATCTCAT
ATGACTATCC CTCAAATCAG GGGTATGTAT GCAGGGGATA TGTCCCGGAA ACAAAATCTT
GTAGACCATG GGTTTCGGCT TCCATCAGCC CTTGACAATA GGCCGTTGAA ATTTCAGGAA
TTTGAAAAAA TGATCAATCA AAATATTTAC GTTTCTGCTA CACCAGGACC ATACGAGAAA
GAACATAGTG AAAGAATAGT GGAGCAGATA ATCCGACCTA CAGGTCTAGT TGACCCTGAA
ACAGAAGTTA GGCCTGTGAA AGGGCAAATA GATGATCTAT ATAGTGAAAT TAATAAGCGA
ACAGACCGTA ATGAGAGAGT TTTAGTGACA ACATTAACTA AAAAGATGGC TGAAGATTTA
ACTGATTATT TGCGTGAAAT GGGAATTAGA GTAAGATATA TGCATTCAGA AATTGATACT
TTGGAGAGAA TGGAAATAAT TCGTGATTTG AGACTTGGTA AATTCGATGT ACTTGTAGGT
ATTAACTTGC TAAGAGAAGG ACTCGATCTA CCAGAAGTAA GTCTAGTTGC CATTTTGGAT
GCAGACAAAG AAGGCTATTT AAGGGATGAA AGGTCTTTAA TCCAAACTAT GGGCCGGGCT
GCCCGAAATG TAAACGGGCG AGTAATTATG TATGGGGATG CCATTACAGA TTCCATGCGA
AGAGCCATTG ATGAAACTAA TCGTAGGAGA GAAAAGCAGA TTGAATTTAA TGCGCGCCAT
AATATCACAC CACAAACTGT TCAAAAGAAA GTACATGATG TAATTGAAGC TACTAGATCT
GCAGAGGACG AAACTGAAGC TGCTACACCA GAAAACATTC AAGAAATGAG TGCCAAAGAA
CGTAAAGAGT TAATCGCTAA ATTACAGGAA GAAATGAAAC AGGCAGCGAA GGAATTAGAA
TTTGAAAAAG CAGCGGAATT AAGAGATTTA ATCATGGAGT TAAAAACTGC TCAATAG
 
Protein sequence
MNEFKLQSDF SLEGDQPKAV DELCESLNGG NSHQTLLGVT GSGKTFTMAN VIQRLQRPTL 
VIAHNKTLAA QLCGEFKEFF PENAVEYFVS YYDYYQPEAY IPQTDTYIEK DASINDEIDK
LRHSATSALF ERRDVIIVAS VSCIYGLGSP EEYREQVLSL RCGMEKDRDE ILKGLVDIQY
SRNDVNFTRG TFRVRGDVIE VFPASYTETA VRIELFGDEI ERITEIDTLT GEILGERNHV
AIFPASHFVT RRSKLEKAIE SIQEELHEQL EYLKRQGKAV EAKRLEQRTN YDLEMLQEMG
FCQGIENYSR HLIGRPAGSR PYCLIDYFPD DYLMVVDESH MTIPQIRGMY AGDMSRKQNL
VDHGFRLPSA LDNRPLKFQE FEKMINQNIY VSATPGPYEK EHSERIVEQI IRPTGLVDPE
TEVRPVKGQI DDLYSEINKR TDRNERVLVT TLTKKMAEDL TDYLREMGIR VRYMHSEIDT
LERMEIIRDL RLGKFDVLVG INLLREGLDL PEVSLVAILD ADKEGYLRDE RSLIQTMGRA
ARNVNGRVIM YGDAITDSMR RAIDETNRRR EKQIEFNARH NITPQTVQKK VHDVIEATRS
AEDETEAATP ENIQEMSAKE RKELIAKLQE EMKQAAKELE FEKAAELRDL IMELKTAQ