Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0786 |
Symbol | |
ID | 7407973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 875644 |
End bp | 877182 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643715164 |
Product | transposase |
Protein accession | YP_002572674 |
Protein GI | 222528792 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01765] transposase, putative, N-terminal domain [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0816314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACAG TTCAGGCGAA GTTAGTGTTC GATAGAGAGG AAGACAAAAA GGCAGTATTA GATCTTATGA GAAGATGGTC CTCTTGTATG AGGTATGCAT ACAAGAGACT ACTGGAAAGG CATAAAAGGA ATGAACTAAA AAGAGAGCTG CAAGGAATTT TTAATCTTAA TTCCCGATAC GTTGATGATG CAATAATGAA AGCAAACAGT GTTTTAAACT CATGCAAAGG AAGAGGAGAA AATCCTGAAA AGGTCATTTT TGGTGGTAGG CAACTTTTTG AAAAACTAAA GAGGCGGTAC ATAAACGGCA AGGTATATAG GAAACTTCAA CGAGAGTGGC AGGAGAAGAG GAAGGGGAAT CTGTACTCAA GAGGAGACAG GAGCAAAAAG GGGAATCTCA ATACAAGGAT TGAGATAGAC GGGAACTTCA CAAAACTCAG GATTAACGTA GGAAAAAGAG AGTACGTATA TGCGACGATA CAAGCTGGAT GGAAGATGAA AGGTAAGACA TACATGGATA GGAACCTACT GCTACAAGCA ATAAGCAGCT TTAGTGGACC TTATTCTGTA GAACTGAAAC TCAAAAACGG TGTAGTATAT GCCTACTTCA CCGTTGAAGA AGTTTTCCCC AAGCCTGCGA TAACGAGAGC AAATGGAGTT ATAGGGATAG ACACTAACGC ATATCCAAAG AATGTTGCAT GGGCAGAAAC AGATGAGTAC GGACAGTTTC TGGGATATGG CAGAATACCA CTTGAGAAGC TTGAGAGTGG AAGCTCAAGC AAGAGAGAGT ATTACAGGTG GCAGTATGCA CACATGATAG TACAAATGGC GAAAGAGAAG CAAAAATCGA TAGTGATTGA GAACCTTAGC ATACAGGACA GGGGCAGAAG AGGCGACTTT TCAGGTAGAA AATCAAGACG GATAAGGCAC TATTTTGGGT ACAGGTCACT TTTGGAGAAG GTAAAACTTC TGGCAAAGCG TGAAGGGATA GAGGTTATAG AAGTAGACCC GGCGTATACT TCTGTGATAG GGATGTTGAA GTATGCACCG CAGTATATGG TGAGCAAGGA TATTGCGGCA GCGTATGTAA TAGCGCGAAG AGGACTTGGC TTGAGAGAAA GGATACCGCA CAATTATATG CTGCTTCTTA GTAGGCTTGA TGTAAACAAC CTGGAAGAGC TAAAAGAGTA TGTAAGGGAG GTAGTCAAGA ACAAACATCT GAGGAAAAAA CAACTCAAAA CGATAGATAG AGCGATAAAG TTTTTACAAA GCTCTGGGAG TGAGCCAGGG AGGCTATCCG TGCCTCTGGA TGGAACAAGC GCGGGTAGTC GTGGCAAAAA ACACAATCCC TGGCAAGTTC TTAGGGTAGC GGTGGTAACG CCACTCTCCC CTGACAGAGT CCTGCGTGAT ATGTCTGTCT TGAAATCGCT TTTGATTTCA GGGCAAGTGG GGAAGACCTG TAAGGGCGTA AGTTCCTGTT TCTTGGGGCA GGGGCTATGG CTTTCCCAAA TACCGCCTGC TGGGGCTGGG AAAGCCTGA
|
Protein sequence | MVTVQAKLVF DREEDKKAVL DLMRRWSSCM RYAYKRLLER HKRNELKREL QGIFNLNSRY VDDAIMKANS VLNSCKGRGE NPEKVIFGGR QLFEKLKRRY INGKVYRKLQ REWQEKRKGN LYSRGDRSKK GNLNTRIEID GNFTKLRINV GKREYVYATI QAGWKMKGKT YMDRNLLLQA ISSFSGPYSV ELKLKNGVVY AYFTVEEVFP KPAITRANGV IGIDTNAYPK NVAWAETDEY GQFLGYGRIP LEKLESGSSS KREYYRWQYA HMIVQMAKEK QKSIVIENLS IQDRGRRGDF SGRKSRRIRH YFGYRSLLEK VKLLAKREGI EVIEVDPAYT SVIGMLKYAP QYMVSKDIAA AYVIARRGLG LRERIPHNYM LLLSRLDVNN LEELKEYVRE VVKNKHLRKK QLKTIDRAIK FLQSSGSEPG RLSVPLDGTS AGSRGKKHNP WQVLRVAVVT PLSPDRVLRD MSVLKSLLIS GQVGKTCKGV SSCFLGQGLW LSQIPPAGAG KA
|
| |