Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0076 |
Symbol | |
ID | 7407313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 92363 |
End bp | 93901 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643714486 |
Product | transposase |
Protein accession | YP_002572009 |
Protein GI | 222528127 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01765] transposase, putative, N-terminal domain [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACAG TTCAGGCGAA GTTAGTGTTC GATAGAGAGG AAGACAAAAA GGCAGTATTA GATCTTATGA GAAGATGGTC CTCTTGTATG AGGTATGCAT ATAAGAGACT ACTGGAAAGG CATAAAAGGA ATGAACTCAA AAGAGAGCTG CAAGGAATTT TTAATCTTAA TTCCCGATAC GTTGATGATG CAATAATGAA AGCAAACAGT GTTTTAAACT CATGCAAAGA AAGAGGAGAA AATCCTGAAA AGGTCATTTT TGGTGGTAGG CAACTTTTTG AAAAACTAAA GAGGCGGCAC ATAAACGGCA AGGTATATAG GAAACTTCAA CGAGAGTGGC AGGAGAAAAG GAAGGGGAAT CTGTACTCAA GAGGAGACAG GAGCAAAAAG GGGAATCTCA ATACAAGGAT TGAGATAGAC GGGAACTTCA CAAAACTCAG GATTAACGTA GGAAAAAGAG AGTACGTATA TGCGACGATA CAAGCTGGAT GGAAGATGAA AGGTAAGACA TACATGGATA GGAACCTACT GCTACAAGCA ATAAGCAGCT CTAGGGGACC TTATTCTGTA GAACTGAAAC TCAAAAACGG TGTAGTATAT GCCTACTTCA CCGTTGAAGA AGTTTTCCCC AAGCCTGCGA TAACGAGAGC AAATGGAGTT ATAGGGATAG ACACTAACGC ATATCCAAAG AATGTTGCAT GGGTAGAAAC AGATGAGCAC GGACAGTTTC TGGGATATGG CAGAATACCA CTTGAGAAGC TTGAGAGTGG AAGCTCAAGC AAGAGAGAGT ATTACAGGTG GCAGTATGCA CACATGATAG TACAAATGGC GAAAGAGAAG CAAAAAGCGA TAGTGATTGA GAACCTTAGC ATACAGGACA GGGGCAGAAG AGGCGACTTT TCAGGTAGAA AATCAAGACG GATAAGGCAC TATTTTGGGT ACAGGTCACT TTTGGAGAAG GTAAAACTTC TGGCAAAGCG TGAAGGGATA GAGGTTATAG AAGTAGACCC GGCGTATACT TCTGTGATAG GGATGTTGAA GTATGCACCG CAGTATATGG TAAGCAAGGA TATTGCGGCA GCGTATGTAA TAGCGCGAAG AGGACTTGGC TTGAGAGAAA GGATACCGCA CAATTATATG CTGCTTCTTA GTAGGCTTGA TGTAAACAAC CTGGAAGAGC TAAAAGAGTA TGTAAGGAAG GTAGTCAAGA ACAAACATCT GAGGAAAAAA CAACTCAAAA CGATAGATAG AGCGATAAAG TTTTTACAAA GCTCTGGGAG TGAGCCAGGG AGGCTATCCG TGCCTCTGGA TGGAACAAGC GCGGGTAGTC GTGGCAAAAA ACACAATCCC TGGCAAGTTC TTAGGGTAGC GGTGGTAACG CCACTCTCCC CTGACAGAGT CCTGCGTGAT ATGTCTGTCT TGAAATCGCT TTTGATTTCA GGGCAAGTGG GGAAGACCTG TAAGGGCGTA AGTTCCTGTT TCTTGGGGCA GGGGCTATGG CTTTCCCAAA TACCGCCTGC TGGGGCTGGG AAAGCCTGA
|
Protein sequence | MVTVQAKLVF DREEDKKAVL DLMRRWSSCM RYAYKRLLER HKRNELKREL QGIFNLNSRY VDDAIMKANS VLNSCKERGE NPEKVIFGGR QLFEKLKRRH INGKVYRKLQ REWQEKRKGN LYSRGDRSKK GNLNTRIEID GNFTKLRINV GKREYVYATI QAGWKMKGKT YMDRNLLLQA ISSSRGPYSV ELKLKNGVVY AYFTVEEVFP KPAITRANGV IGIDTNAYPK NVAWVETDEH GQFLGYGRIP LEKLESGSSS KREYYRWQYA HMIVQMAKEK QKAIVIENLS IQDRGRRGDF SGRKSRRIRH YFGYRSLLEK VKLLAKREGI EVIEVDPAYT SVIGMLKYAP QYMVSKDIAA AYVIARRGLG LRERIPHNYM LLLSRLDVNN LEELKEYVRK VVKNKHLRKK QLKTIDRAIK FLQSSGSEPG RLSVPLDGTS AGSRGKKHNP WQVLRVAVVT PLSPDRVLRD MSVLKSLLIS GQVGKTCKGV SSCFLGQGLW LSQIPPAGAG KA
|
| |