Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2036 |
Symbol | |
ID | 6317154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2149106 |
End bp | 2151940 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642644424 |
Product | Excinuclease ABC subunit A |
Protein accession | YP_001918191 |
Protein GI | 188586646 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0663505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000130956 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGACA AATTAGTAAT AAAAGGTGCA CGTGAGCACA ATTTAAAAAA TGTAGATTTG GAAATTCCAC GAAATAAATT TGTAGTTATG ACAGGTTTAA GCGGTTCGGG GAAGAGTTCT TTGGCTTTTG ATACTATATA TGCCGAAGGT CAGCGACGAT ATGTGGAATC TTTAAGCGCT TACGCTCGAC AGTTTTTAGG ACAAATGGAC AAACCGGATG TTGATTATTT AGAAGGACTT AGCCCGGCTA TTTCAATAGA TCAAAAAACA ACCAGCCAAA ACCCAAGATC GACAGTGGGA ACGGTAACAG AGATTTATGA TTATCTCAGA TTGCTTTATG CTAGAGTAGG GCGTCCACAC TGTCCTAACT GCCATAAACC AATTACCCAG CAAACTGTAG ATCAGATGGT GGACCAAATC ATTACTCTTC CCGAAGGGAC AAAATTCCAG ATATTGGCTC CTATAGTTAG AGGGAGGAAA GGTCAACACG AAAAAGTTTT GGAAGATGCT CGCAAAAGCG GATATGTTAG AGCAAGAATT GATGGAGACA TTAAATTACT ATCAGAAGAA ATAAATCTTG AAAAAAATAA GCAGCATACA ATTGAAATAG TGATTGACCG TTTAAAAATG AAATCAGGTA TTCAAAATAG ATTGGCCGAC TCTTTAGAGA GTGCTTTAAG CATTGCCGAT GGTCTTGTTA ATATCCATGT TTTAGATGAA GAACGCGAAC TATTATTTAG CCAAAAATAT GCTTGTATTG AATGCGGTTT TAGTTTCCCA GAGTTAGCTC CTAGGATGTT CAGTTTTAAC AGTCCTTACG GTGCTTGTAC TCATTGTGAT GGTCTTGGTG AAAAGAGAGA ATTTGATATA GATTTGATAG TGCCTGATAA AAGCATGTCA ATAAATGAAG GTGCAATTTT GCCTTGGAGT AAAAATAAAG ATGGGTATTT CTTTAATTTG TTGACGGCCG TCTGCCAAAG TTTTGGGATT GATATGGATA CTCCTTTTGA AGAGTTATCT AAAGAAGAAC AGAATCTTCT CTTGTACGGT TCAGGAGACG AGAAGTTTTC ATTTAGTTTT AGAACAGGTC GTGGTCGTTG GTATGAAGGA AGCAGAGTTT TTGAAGGTAT TATACCGAAT TTATCTCGCC ATTATAAGAG TACAAATTCA GATAGATTTA GGGAAGAAAT AGAATCTTAT ATGGCTTCGA TTCCTTGTAG TCACTGTAAC GGTAAAAGGT TGAGGTCCGA GAGTCTAGCC GTTAAAGTTG GAGACATGAA TATTGGTCAG GTTACGGAAA TGACCGTCAA AGAAGCTTTA GACTTTTTTA CCAATATTGA ATTAACTGAT AAAGAATGGA AAATTGCCCG CCTAATTCTT AAAGAGATAA ATGAACGGCT AAAATTTCTT AAAGACGTGG GGCTTGAATA CTTGACCCTG GAAAGGTCTG CTGGAACTTT AAGTGGTGGG GAAGCTCAAC GGATAAGATT GGCAACTCAG ATTGGCTCAA GTTTAACCGG AGTTTTGTAT ATCCTTGATG AGCCTAGTAT TGGCCTACAT CAGAGGGATA ATGAACGTTT GATAAGGACT TTAGAGAATC TAAGAGATAT TGGAAACACT TTGATTGTAG TAGAGCATGA TGAGACTACC ATTAGAAGAG CAGATCATTT GGTAGATATT GGCCCTGGGG CAGGTCGTGA TGGTGGACAT GTAGTAGCTC AGGGAACAAT AGATGATATT TGTGACCAAA AAGACTCAAT CACAGGTAAA TTTTTACAAG GTGAAGAAGT GATCCCGACA CCACAAAAGA GAAGGAGCTC AAACGGTAAG TCAATTGATA TTAAGGGAGC GGCAGCCCAT AATCTGGATA ATGTTGATGT AGAAATCCCC ATGGGAATAC TTAATTGTGT AACTGGAGTT TCCGGTTCGG GAAAAAGTAC TTTGATTCAT GAAGTCCTGT ATAAAGGTTT GGCTGCCAAA CTTCACAAAG CAAGAAAGAA ACCAGGGTAT TTTAAAGAAA TAAAAGGAAT AGAAAATCTG GACAAGGTGA TAGAGATTGA CCAATCTCCT ATTGGCAGAA CTCCTCGCTC TAATCCGGCT ACTTACACAG GTGTTTTTGA TCATATCAGG GAAATCTTTA GCGAAACTCC TGAAGCCCGG ATGAGAGGTT ATAAACCTGG TAGATTTAGT TTTAATGTGA GAGGTGGAAG ATGTGAAGCT TGTAAGGGAG ACGGGATTAT AAAAATAGAG ATGCACTTTT TACCCGATGT TTATGTTCCC TGTGAAGTCT GTCAAGGTAA AAGATATAAT AGAGAGACTC TTGAGGTCAG ATATAAAGGT AAGACTATTG CTGACGTTTT AGACATGGAT GTCAATACAG CCCTTGAGTT CTTTGCTTCT ATACCCAAGA TAAGAAGAAA ATTACAGACA GTTGCAGATG TGGGTTTGGG CTATATCAAA CTGGGGCAAC CAGCTACCAC GCTTTCTGGT GGAGAAGCAC AGCGTGTGAA ATTGGCTTCT GAACTTAGCC GTAGAAGCAA TGGACGTACC CTATACATCT TGGATGAACC AACTACAGGC CTTCATATGG CCGATGTTAA AAAATTGTTG GGAGTTTTAC AAAGACTTGT GAATAATGGA GATACGGTAA TTGTAATAGA GCATGATCTG GATGTGATTA AAACTGCAGA TCATATTATC GATTTAGGTC CTGAAGGTGG TTCAGAAGGA GGGCGTATAA TTGCCCAGGG CACCCCTGAG GAAGTTTCTC AAAATGAAAA ATCCTATACC GGACAGTTTT TAGAAGAGAT TCTTAAAAAG CGGGAATCGG CTTAG
|
Protein sequence | MQDKLVIKGA REHNLKNVDL EIPRNKFVVM TGLSGSGKSS LAFDTIYAEG QRRYVESLSA YARQFLGQMD KPDVDYLEGL SPAISIDQKT TSQNPRSTVG TVTEIYDYLR LLYARVGRPH CPNCHKPITQ QTVDQMVDQI ITLPEGTKFQ ILAPIVRGRK GQHEKVLEDA RKSGYVRARI DGDIKLLSEE INLEKNKQHT IEIVIDRLKM KSGIQNRLAD SLESALSIAD GLVNIHVLDE ERELLFSQKY ACIECGFSFP ELAPRMFSFN SPYGACTHCD GLGEKREFDI DLIVPDKSMS INEGAILPWS KNKDGYFFNL LTAVCQSFGI DMDTPFEELS KEEQNLLLYG SGDEKFSFSF RTGRGRWYEG SRVFEGIIPN LSRHYKSTNS DRFREEIESY MASIPCSHCN GKRLRSESLA VKVGDMNIGQ VTEMTVKEAL DFFTNIELTD KEWKIARLIL KEINERLKFL KDVGLEYLTL ERSAGTLSGG EAQRIRLATQ IGSSLTGVLY ILDEPSIGLH QRDNERLIRT LENLRDIGNT LIVVEHDETT IRRADHLVDI GPGAGRDGGH VVAQGTIDDI CDQKDSITGK FLQGEEVIPT PQKRRSSNGK SIDIKGAAAH NLDNVDVEIP MGILNCVTGV SGSGKSTLIH EVLYKGLAAK LHKARKKPGY FKEIKGIENL DKVIEIDQSP IGRTPRSNPA TYTGVFDHIR EIFSETPEAR MRGYKPGRFS FNVRGGRCEA CKGDGIIKIE MHFLPDVYVP CEVCQGKRYN RETLEVRYKG KTIADVLDMD VNTALEFFAS IPKIRRKLQT VADVGLGYIK LGQPATTLSG GEAQRVKLAS ELSRRSNGRT LYILDEPTTG LHMADVKKLL GVLQRLVNNG DTVIVIEHDL DVIKTADHII DLGPEGGSEG GRIIAQGTPE EVSQNEKSYT GQFLEEILKK RESA
|
| |