Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2426 |
Symbol | |
ID | 7408050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2571075 |
End bp | 2572406 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643716789 |
Product | HNH endonuclease |
Protein accession | YP_002574267 |
Protein GI | 222530385 |
COG category | [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATTTG TACTGAACAG GGATAAAACA CCTTTAGCAC CCTGTCATGA AGCGGTTGCG AGAAAATTGC TTAAGCATGG AAAAGCAGTA ATCCATAGAA TCTACCCTTT CACTATAAGA CTCAAAGAGC AAAAAGATAC ATCAACGTTC AAGCCAAACT ACAGACTTAA AATTGACTAT GGTAGCAGGT GCACAGGAAT TGTCATATTG AAGAACAATT GCGAAGTTGT ATTCATGATG AAGTTATATC ACAGAACAGA GATAAAAGAG AACATGGATA GAAGACGCAG TTTGAGACGT AGTAGGAGAA ATAGAAAGAC AAGGTACAGA AAAGCAAGAT TTTCGAACAG GAGAAGGGAT GAGAATTGGT TACCGCCTAC ACTACTGAGC AGGGTAAGAA ACATTGAAAC GTGGGTAAAA AGACTTTGCA AACTGTGTCC TGTTACTGCT ATTTCATATG AGAACGTCAA ATTTGATACG CAGAAGCTAA GAAATCCAGA GATTTCTGGA ATAGAATACC AGCATGGTAC TCTTCAGGGT TATGAAGTAA AAGAATATTT GCTTGAGAAA TTCAACTGGA GATGCGTCTA CTGTGGTGCT ACAGGAGTAC CACTCGAAGT TGAACATGTC ATTCCAAAGT CAAGAGGTGG AACAGACAGA GTAGACAACC TTGTCATAGC CTGCCATGGG TGTAACCAGA AAAAAGGGAA TAAGACAGCG GAAGAGTTTG GGTATCCAGA GATTCAGAAA CTTGTAAAAG CGCCACTAAG GGATTGTGCA TTGGTTAACG CAACAAGATG GAGAATATAC GAAGTTTTAA AAAACACAGG TTTACCTGTG GAATGTGGAA GTGGAGCTTT GACAAAGATG AATAGAATCA AGTTAGGACT ACCAAAAGAC CATCATTTTG ATGCAATCTG TGTAGGGTAT TCTACACCAA ATAGAATTTG GCTCAAGACC AAAACTGTTT TGCACGTGAT AGCGAAAGGT AGAGGCACAA GACAAATTGC TATACTTGAC AGGTATGGTT TTCCACGTGG GCATAGAACT AGAAAGAAAT TTTTCTATGG CTTTCAAACT GGTGATATGG TAAAAGTAGT TGTTCCGAAA GGCAAGTACA AAGGCACATG GGTAGGGACA GTATCGTGTA GAAACAGCGG GTATTTTGAT ATAAAGGACA AAACCGGGAA AAGGATTGTT CAGAGTATTT CATATAAACA CTGCAAAATA ATTCAACGAT TTGATGGGTA CTGCTACGAA TTAGAACGAA TAAGAATCTC AGGCACATTT CCACTCCAAC CTGTAGAGGT TGGAGCCTCC ATGTGCCAGT AG
|
Protein sequence | MVFVLNRDKT PLAPCHEAVA RKLLKHGKAV IHRIYPFTIR LKEQKDTSTF KPNYRLKIDY GSRCTGIVIL KNNCEVVFMM KLYHRTEIKE NMDRRRSLRR SRRNRKTRYR KARFSNRRRD ENWLPPTLLS RVRNIETWVK RLCKLCPVTA ISYENVKFDT QKLRNPEISG IEYQHGTLQG YEVKEYLLEK FNWRCVYCGA TGVPLEVEHV IPKSRGGTDR VDNLVIACHG CNQKKGNKTA EEFGYPEIQK LVKAPLRDCA LVNATRWRIY EVLKNTGLPV ECGSGALTKM NRIKLGLPKD HHFDAICVGY STPNRIWLKT KTVLHVIAKG RGTRQIAILD RYGFPRGHRT RKKFFYGFQT GDMVKVVVPK GKYKGTWVGT VSCRNSGYFD IKDKTGKRIV QSISYKHCKI IQRFDGYCYE LERIRISGTF PLQPVEVGAS MCQ
|
| |