Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1928 |
Symbol | |
ID | 7407341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2036435 |
End bp | 2037973 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643716300 |
Product | transposase |
Protein accession | YP_002573789 |
Protein GI | 222529907 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01765] transposase, putative, N-terminal domain [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00442004 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACAG TTCAGGCGAA GTTAGTGTTC GATAGAGAGG AAGACAAAAA GGCAGTATTA GATCTTATGA GAAGATGGTC CTCTTGTATG AGGTATGCAT ACAAGAGGTT ACTGGAAGGG CATAAAAGGA ATGAACTCAA AAGAGAACTG CAAGGAATTT TTAATCTTAA TTCCCGATAC GTTGATGATG CAATAATGAA AGCAAACAGT GTTTTAAACT CATGCAAAGA AAGAGGAGAA AATCCTGAAA AGGTCATTTT TGGTGGCAGG CAACTTTTTG AAAAACTAAA GAGGCGGCAC ATAAACGGCA AGGTATATAG GAAACTTCAA CGAGAGTGGC AGGAGAAAAG GAAGGGGAAT CTGTACTCAA GAGGAGACAG GAGCAAAAAG GGGAATCTCA ATACAAGGAT TGAGATAGAC GGGAACTTCA CAAAACTCAG GATTAACGTA GGAAAAAGAG AGTACGTATA TGCGACGATA CAAGCTGGAT GGAAGATGAA AGGTAAGACA TACATGGATA GGAACCTACT GCTACAAGCA ATAAGCAGCT TTAGTGGACC TTATTCTGTA GAACTGAAAC TCAAAAACGG TGTAGTATAT GCCTACTTCA CCGTTGAAGA AGTTTTCCCC AAGCCTGCGA TAACGAGAGC AAATGGAGTT ATAGGGATAG ACACTAACGC ATATCCAAAG AATGTTGCAT GGGCAGAAAC AGATGAGTAC GGACAGTTTC TGGGATATGG CAGAATACCA CTTGAGAAGC TTGAGAGTGG AAGCTCAAGC AAGAGAGAGT ATTACAGGTG GCAGTATGCA CACATGATAG TACAAATGGC GAAAGAGAAG CAAAAAGCGA TAGTGATTGA GAACCTTAGC ATACAGGACA GGGGCAGAAG AGGCGACTTT TCAGGTAGAA AATCAAGACG GATAAGGCAC TATTTTGGAA GCAGATTACT TTTGGAGAAG GTAAAACTTC TGGCAAAACG GGAAGGAGTA GAGGTTATAG AAGTAGACCC GGCGTATACT TCTGTGATAG GGATGTTGAA GTATGCACCG CAGTATATGG TGAGCAAGGA TATTGCGGCA GCGTATGTAA TAGCGCGAAG AGGACTTGGT TTGAGAGAAA GGATACCGCA CAATTATATG CTGCTTCTTA GTAGGCTTGA TGTAAACAAC CTGGAAGAGC TAAAAGAGTA TGTAAGGAAG GTAGTCAAGA ACAAACATCT GAGGAAAAAA CAACTCAAAG CGATAGATAG AGCGATAAAG TTTTTACAAA GCTCTGGGAG TGAGCCAGGG AGGCTATCCG TGCCTCTGGA TGGAACAAGC GCGGGTAGTC GTGGCAAAAA ACACAATCCC TGGCAAGTTC TTAGGGTAGC GGTGGTAACG CCACTCTCCC CTGACAGAGT CCTGCGTGAT ATGTCTGTCT TGAAATCGCT TTTGATTTCA GGGCAAGTGG GGAAGACCTG TAAGGGCGTA AGTTCCTGCT TCTTGGGGCA GGGGCTATGG CTTTCCCAAA TACCGCCTGC TGGGGCTGGG AAAGCCTGA
|
Protein sequence | MVTVQAKLVF DREEDKKAVL DLMRRWSSCM RYAYKRLLEG HKRNELKREL QGIFNLNSRY VDDAIMKANS VLNSCKERGE NPEKVIFGGR QLFEKLKRRH INGKVYRKLQ REWQEKRKGN LYSRGDRSKK GNLNTRIEID GNFTKLRINV GKREYVYATI QAGWKMKGKT YMDRNLLLQA ISSFSGPYSV ELKLKNGVVY AYFTVEEVFP KPAITRANGV IGIDTNAYPK NVAWAETDEY GQFLGYGRIP LEKLESGSSS KREYYRWQYA HMIVQMAKEK QKAIVIENLS IQDRGRRGDF SGRKSRRIRH YFGSRLLLEK VKLLAKREGV EVIEVDPAYT SVIGMLKYAP QYMVSKDIAA AYVIARRGLG LRERIPHNYM LLLSRLDVNN LEELKEYVRK VVKNKHLRKK QLKAIDRAIK FLQSSGSEPG RLSVPLDGTS AGSRGKKHNP WQVLRVAVVT PLSPDRVLRD MSVLKSLLIS GQVGKTCKGV SSCFLGQGLW LSQIPPAGAG KA
|
| |