Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2200 |
Symbol | |
ID | 7408396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2328672 |
End bp | 2329886 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716568 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_002574048 |
Protein GI | 222530166 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTAA AACCTATTGC CGGGATTGAT GTCTCCAAAT ATTTCAGCGA AATGGTGGTT ATCTCTCCTG CAAATGAAAT ACTTGCACGC TTGACTATCC ATCACAATAA TCCCGCTGAC TTTGAGAGGG CTATTGAAAT CCTTAAAAAA GTTGAAGAGG ATTTCGCGGC TCGCCCTATC ATCGTCATGG AAGCCACAGG GCATTACCAC AAAATCCTCT CCCGCTTCTT TACTTGTCGG GGGTGGGATG TTTCTATTAT AAACCCCTAC AATCTAATTC TATCAAAAAT GCGGGAGAAA GTAAAAGTAA AAAATGATAA AAATGATGCC CTGTGGATTG CCTTAACCTT CAGGCTTACT AACTCTGCTA TAGCACAACC TTCATCTGAA ACCCTCGATT GCTTGAAAAA CCTATGCCGT CAGTACTACA ACCTCAGCGA TGAGTTGACC TCTTACAAAT ACAGACTTAC TTCTGTCGTC GACCAAATTA TGCTCAACTT CAAAGAGGTC TTCCCTGACA TTTGCTCTAA AACATCTTTG GCTATACTTG AAAACTACCC TACTCCAAAC GATATCCTAA GCGCTGACAG TGAAAAACTT ATTTCCATCA TTCAGCAAAC TTCTAAAAAA AGCTACCAAT GGGCTAAAGA AAAATATGAT CTACTCATCG CGAAAGCTAA AGAATTTAAG CCTTTTTCTA TCTCCAACTT GGCAAATGTT ACTATGCTTA AAGTCTACAT TAACATGGTC TTGACTTTGC AACAAAACAT CGACAAAATT TTTGAAGCCA TAAATCAGCT TGTTCAGCAG TCTTCACAGA CTCAACCTTC AATATCCGAA AATATTAACC TTCTTCAATC TATCCCCGGC ATAGGTTTTC TCACTGCTGC AACTATCCTT GCTGAAATAG GTGACTTCGA AAAATTTTCA AAACCTAATA AACTTGTTGC CTTCTTTGGC ATTGATCCTT CCGTAAATCA ATCCGGGCAA TTTGTCTCTA CATCAAACAA AATGTCTAAA CGTGGCTCTA AAATCTTGCG AAGAATCTTA TTTACAATTG CTCTTGCCAA TATCAGAACC AAAAGAGATT CTAAGCCTTG TAATCCTGTA CTATTCGAAT ACTATCAGAA AAAGTGCCAA CAAAAGCCCA AAAAAGTTGC TATTTTTCGC TGTTATGAGA AAGCTCATAT GCATTATCTT TGCTGTTATG CGTGA
|
Protein sequence | MNLKPIAGID VSKYFSEMVV ISPANEILAR LTIHHNNPAD FERAIEILKK VEEDFAARPI IVMEATGHYH KILSRFFTCR GWDVSIINPY NLILSKMREK VKVKNDKNDA LWIALTFRLT NSAIAQPSSE TLDCLKNLCR QYYNLSDELT SYKYRLTSVV DQIMLNFKEV FPDICSKTSL AILENYPTPN DILSADSEKL ISIIQQTSKK SYQWAKEKYD LLIAKAKEFK PFSISNLANV TMLKVYINMV LTLQQNIDKI FEAINQLVQQ SSQTQPSISE NINLLQSIPG IGFLTAATIL AEIGDFEKFS KPNKLVAFFG IDPSVNQSGQ FVSTSNKMSK RGSKILRRIL FTIALANIRT KRDSKPCNPV LFEYYQKKCQ QKPKKVAIFR CYEKAHMHYL CCYA
|
| |