Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0891 |
Symbol | |
ID | 7407466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 993999 |
End bp | 995228 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643715265 |
Product | transposase mutator type |
Protein accession | YP_002572774 |
Protein GI | 222528892 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA ACGAAATTCA CGAAACTGCT AAAAACATGG CTGTAGAGCA AGTATTAAAT ATGTATTGCT CCAAAGATGA TCCTAACCGC CCAGCTCTAA AACAACTCTT AGAAAACTTG CTCGATTGCT TTATGTTATC GGAAAGATCA GTGTACCTTG CTAAAAATGA CAATGACAAA GGCAATGGTT TTTACGATAG AAAACTTGCA ACACCTGTTG GCAGTCTTGA AATCTCTGTC CCTCGCACAC GTACTGGTAA TTTCCGACCT TCTATCCTCC CTGACCGCTA CAAAAGAGTT GATAGTTCAT ACACTGACCT GCTTATGTCT TTAGTCGTCA ATGGTTATTC CGAAAGTTCC CTTGTCCAGA CTTTGAAAGC TTTGAATCTT CCATATTCCG AAAATGAAAT ACTAAAAATC AAAGAAGACC TTAAAAATGA GCTTCAGTTA TTCAAACAAA GAGAACTACC AACAAGTGCT TTTGCTCTCA TCATCGATGG TTATCATTGT GAAGTTAAGG ATAATTCTAA GGTTAAACAA GCTACTTGTT ATGTTGTCCT CGGTATCGAC TTAGAAGGTA AAAAAGACAT TTTCGGTGTC TACACTTTCT TCGGCAAAGA AAATAAGGCT GATTGGATGA AAGTATTTGA AGACTTAATT ACAAGAGGGC TAAAAGAGAT TCTAATTGTC ATAAGTGATG ACTTCCCAGG TATTATAGAT GCTGTCAAAC TTGCTTATCC TCTTGCTGAC CATCAACTGT GTTTTGTCCA CCTCCAACGT AATGTCAGAA AACATATGAC AAAAGAGGAT GCTTCAGCTT TTAACAAGAG CTTGGACAAA ATCAAAACCT TTTCTCCTGA TTTCGATGAA GCTGTATTGA AATTTAAAGA ACTTTGTGAT GAATACCTTG CAAAATATCC TCGATTTATT AAAGCAATAT CAGAAAAAGC AGAGTTTTAT CTTGCCCATA TGAAATACCC CGAGGAATTA AGAAAGCATA TCTATACCAC AAACGCCGTT GAAAGTGTAA ACAGCATGAT TGAAAAGATT AGAGTAAATT CAGGTGGATA CTTTCAGACT GCCAAAGTCT TAGAAATTAA TATTTACTTA CAGCGAGAGA ACTTACGCCG TACAAAATGG AAAAATGGAG TTCCCAGTAT TAGAAAATGC ATCAATAACA TAACCCAACT TTACAACTTG CGTTATAAAT TGGAAACACA AAATTCTTGA
|
Protein sequence | MNKNEIHETA KNMAVEQVLN MYCSKDDPNR PALKQLLENL LDCFMLSERS VYLAKNDNDK GNGFYDRKLA TPVGSLEISV PRTRTGNFRP SILPDRYKRV DSSYTDLLMS LVVNGYSESS LVQTLKALNL PYSENEILKI KEDLKNELQL FKQRELPTSA FALIIDGYHC EVKDNSKVKQ ATCYVVLGID LEGKKDIFGV YTFFGKENKA DWMKVFEDLI TRGLKEILIV ISDDFPGIID AVKLAYPLAD HQLCFVHLQR NVRKHMTKED ASAFNKSLDK IKTFSPDFDE AVLKFKELCD EYLAKYPRFI KAISEKAEFY LAHMKYPEEL RKHIYTTNAV ESVNSMIEKI RVNSGGYFQT AKVLEINIYL QRENLRRTKW KNGVPSIRKC INNITQLYNL RYKLETQNS
|
| |