Gene Nther_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2036 
Symbol 
ID6317154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2149106 
End bp2151940 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content39% 
IMG OID642644424 
ProductExcinuclease ABC subunit A 
Protein accessionYP_001918191 
Protein GI188586646 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0663505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000130956 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGACA AATTAGTAAT AAAAGGTGCA CGTGAGCACA ATTTAAAAAA TGTAGATTTG 
GAAATTCCAC GAAATAAATT TGTAGTTATG ACAGGTTTAA GCGGTTCGGG GAAGAGTTCT
TTGGCTTTTG ATACTATATA TGCCGAAGGT CAGCGACGAT ATGTGGAATC TTTAAGCGCT
TACGCTCGAC AGTTTTTAGG ACAAATGGAC AAACCGGATG TTGATTATTT AGAAGGACTT
AGCCCGGCTA TTTCAATAGA TCAAAAAACA ACCAGCCAAA ACCCAAGATC GACAGTGGGA
ACGGTAACAG AGATTTATGA TTATCTCAGA TTGCTTTATG CTAGAGTAGG GCGTCCACAC
TGTCCTAACT GCCATAAACC AATTACCCAG CAAACTGTAG ATCAGATGGT GGACCAAATC
ATTACTCTTC CCGAAGGGAC AAAATTCCAG ATATTGGCTC CTATAGTTAG AGGGAGGAAA
GGTCAACACG AAAAAGTTTT GGAAGATGCT CGCAAAAGCG GATATGTTAG AGCAAGAATT
GATGGAGACA TTAAATTACT ATCAGAAGAA ATAAATCTTG AAAAAAATAA GCAGCATACA
ATTGAAATAG TGATTGACCG TTTAAAAATG AAATCAGGTA TTCAAAATAG ATTGGCCGAC
TCTTTAGAGA GTGCTTTAAG CATTGCCGAT GGTCTTGTTA ATATCCATGT TTTAGATGAA
GAACGCGAAC TATTATTTAG CCAAAAATAT GCTTGTATTG AATGCGGTTT TAGTTTCCCA
GAGTTAGCTC CTAGGATGTT CAGTTTTAAC AGTCCTTACG GTGCTTGTAC TCATTGTGAT
GGTCTTGGTG AAAAGAGAGA ATTTGATATA GATTTGATAG TGCCTGATAA AAGCATGTCA
ATAAATGAAG GTGCAATTTT GCCTTGGAGT AAAAATAAAG ATGGGTATTT CTTTAATTTG
TTGACGGCCG TCTGCCAAAG TTTTGGGATT GATATGGATA CTCCTTTTGA AGAGTTATCT
AAAGAAGAAC AGAATCTTCT CTTGTACGGT TCAGGAGACG AGAAGTTTTC ATTTAGTTTT
AGAACAGGTC GTGGTCGTTG GTATGAAGGA AGCAGAGTTT TTGAAGGTAT TATACCGAAT
TTATCTCGCC ATTATAAGAG TACAAATTCA GATAGATTTA GGGAAGAAAT AGAATCTTAT
ATGGCTTCGA TTCCTTGTAG TCACTGTAAC GGTAAAAGGT TGAGGTCCGA GAGTCTAGCC
GTTAAAGTTG GAGACATGAA TATTGGTCAG GTTACGGAAA TGACCGTCAA AGAAGCTTTA
GACTTTTTTA CCAATATTGA ATTAACTGAT AAAGAATGGA AAATTGCCCG CCTAATTCTT
AAAGAGATAA ATGAACGGCT AAAATTTCTT AAAGACGTGG GGCTTGAATA CTTGACCCTG
GAAAGGTCTG CTGGAACTTT AAGTGGTGGG GAAGCTCAAC GGATAAGATT GGCAACTCAG
ATTGGCTCAA GTTTAACCGG AGTTTTGTAT ATCCTTGATG AGCCTAGTAT TGGCCTACAT
CAGAGGGATA ATGAACGTTT GATAAGGACT TTAGAGAATC TAAGAGATAT TGGAAACACT
TTGATTGTAG TAGAGCATGA TGAGACTACC ATTAGAAGAG CAGATCATTT GGTAGATATT
GGCCCTGGGG CAGGTCGTGA TGGTGGACAT GTAGTAGCTC AGGGAACAAT AGATGATATT
TGTGACCAAA AAGACTCAAT CACAGGTAAA TTTTTACAAG GTGAAGAAGT GATCCCGACA
CCACAAAAGA GAAGGAGCTC AAACGGTAAG TCAATTGATA TTAAGGGAGC GGCAGCCCAT
AATCTGGATA ATGTTGATGT AGAAATCCCC ATGGGAATAC TTAATTGTGT AACTGGAGTT
TCCGGTTCGG GAAAAAGTAC TTTGATTCAT GAAGTCCTGT ATAAAGGTTT GGCTGCCAAA
CTTCACAAAG CAAGAAAGAA ACCAGGGTAT TTTAAAGAAA TAAAAGGAAT AGAAAATCTG
GACAAGGTGA TAGAGATTGA CCAATCTCCT ATTGGCAGAA CTCCTCGCTC TAATCCGGCT
ACTTACACAG GTGTTTTTGA TCATATCAGG GAAATCTTTA GCGAAACTCC TGAAGCCCGG
ATGAGAGGTT ATAAACCTGG TAGATTTAGT TTTAATGTGA GAGGTGGAAG ATGTGAAGCT
TGTAAGGGAG ACGGGATTAT AAAAATAGAG ATGCACTTTT TACCCGATGT TTATGTTCCC
TGTGAAGTCT GTCAAGGTAA AAGATATAAT AGAGAGACTC TTGAGGTCAG ATATAAAGGT
AAGACTATTG CTGACGTTTT AGACATGGAT GTCAATACAG CCCTTGAGTT CTTTGCTTCT
ATACCCAAGA TAAGAAGAAA ATTACAGACA GTTGCAGATG TGGGTTTGGG CTATATCAAA
CTGGGGCAAC CAGCTACCAC GCTTTCTGGT GGAGAAGCAC AGCGTGTGAA ATTGGCTTCT
GAACTTAGCC GTAGAAGCAA TGGACGTACC CTATACATCT TGGATGAACC AACTACAGGC
CTTCATATGG CCGATGTTAA AAAATTGTTG GGAGTTTTAC AAAGACTTGT GAATAATGGA
GATACGGTAA TTGTAATAGA GCATGATCTG GATGTGATTA AAACTGCAGA TCATATTATC
GATTTAGGTC CTGAAGGTGG TTCAGAAGGA GGGCGTATAA TTGCCCAGGG CACCCCTGAG
GAAGTTTCTC AAAATGAAAA ATCCTATACC GGACAGTTTT TAGAAGAGAT TCTTAAAAAG
CGGGAATCGG CTTAG
 
Protein sequence
MQDKLVIKGA REHNLKNVDL EIPRNKFVVM TGLSGSGKSS LAFDTIYAEG QRRYVESLSA 
YARQFLGQMD KPDVDYLEGL SPAISIDQKT TSQNPRSTVG TVTEIYDYLR LLYARVGRPH
CPNCHKPITQ QTVDQMVDQI ITLPEGTKFQ ILAPIVRGRK GQHEKVLEDA RKSGYVRARI
DGDIKLLSEE INLEKNKQHT IEIVIDRLKM KSGIQNRLAD SLESALSIAD GLVNIHVLDE
ERELLFSQKY ACIECGFSFP ELAPRMFSFN SPYGACTHCD GLGEKREFDI DLIVPDKSMS
INEGAILPWS KNKDGYFFNL LTAVCQSFGI DMDTPFEELS KEEQNLLLYG SGDEKFSFSF
RTGRGRWYEG SRVFEGIIPN LSRHYKSTNS DRFREEIESY MASIPCSHCN GKRLRSESLA
VKVGDMNIGQ VTEMTVKEAL DFFTNIELTD KEWKIARLIL KEINERLKFL KDVGLEYLTL
ERSAGTLSGG EAQRIRLATQ IGSSLTGVLY ILDEPSIGLH QRDNERLIRT LENLRDIGNT
LIVVEHDETT IRRADHLVDI GPGAGRDGGH VVAQGTIDDI CDQKDSITGK FLQGEEVIPT
PQKRRSSNGK SIDIKGAAAH NLDNVDVEIP MGILNCVTGV SGSGKSTLIH EVLYKGLAAK
LHKARKKPGY FKEIKGIENL DKVIEIDQSP IGRTPRSNPA TYTGVFDHIR EIFSETPEAR
MRGYKPGRFS FNVRGGRCEA CKGDGIIKIE MHFLPDVYVP CEVCQGKRYN RETLEVRYKG
KTIADVLDMD VNTALEFFAS IPKIRRKLQT VADVGLGYIK LGQPATTLSG GEAQRVKLAS
ELSRRSNGRT LYILDEPTTG LHMADVKKLL GVLQRLVNNG DTVIVIEHDL DVIKTADHII
DLGPEGGSEG GRIIAQGTPE EVSQNEKSYT GQFLEEILKK RESA