Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2506 |
Symbol | |
ID | 7409375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2633071 |
End bp | 2634354 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643716869 |
Product | transposase |
Protein accession | YP_002574347 |
Protein GI | 222530465 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01765] transposase, putative, N-terminal domain [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000109626 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTAA CGTTATCCTG TAAATTTAAA CTCTGGCTGT CACAAGAACA AAAAAGAAAA CTCATCGAAA CTGCAAAAAC ATATACCAGT GCTATTAACT TTGTCTTAGC CGAAAATCTG AAAGACAAAA CAACTAACGT AAAGAAACTA CACAAACTCT ACTACAAAAC CATCAGAAAA AAGTTTTCTC TTCCCTCCCA GCTGGCTATC AATGTCTACC GACAAGTTGC AGACATATAC CAAACCCTGT GGGCACAATA TAACAACCTG CTGTACAGAG AGCAAAATAG CAACAACAAT AGTGGTACCG CTGAAGAATT CTGGAGTAAA CCACCAAAAC GAAAAACTCT CACAGTAAAC TACACGCACG GTCGCACCTT CTCCATCAAG TACGACAAAA ATACAGACAC CTTCTTCGTA TCCATTTCCT CCATCCATGG CAGAATTAAA AACGTTCGAA TCACCGGTTG GAAACAACAC TACAACTATT TAAAACACGG CGAAATAGGT GACCCTGTGC TTGTCTATGA TAAACCATCA AAAGAATTTT ACCTGCACAT CCCAGTAACC CTGGAAATCG ACGAAAAACT GCACAAAGAA ATTGCTGGTA TAGACGTTGG AGAGAGAAAT ATTGTAACAG TAGTGTCAAC TGCTGGTGCG AGATATACTA TACCACTTCC TGACCAGGTT AGACGTACCA AGCGTCACTA TCACGAGTTG CGCTCTCAGT TGATGTCAAA AGGCACTCGC TCTGCCAGAA GAAAACTCCA AAAGATTGGC ATGAGCGAGA AACGGTTCGT GTCCAACTTT CTACATAAAC TCACTAAGGA CCTTGTCAGG AAGCACCCGG CAGCACTATT TGTCATGGAA GATTTGAGCA TGATCAGAAC AAACAGGATA ACGTATCGTG GCAATGATAG TGAAGCGCGC CGCCAAGCAG AACAATGGCC TTTTGCCGAA CTACAAAACA AATTGGAGTA CAAATCAATA CTCTACAATG GAATATGTTC AGTCAAAGTT GACCCTTCGT ATACTTCGCT ATCCTGTCCT GTTTGTGGAC ATGTATCGAA AGACAACCGC CCCGGACATG GTGAACTATT TAAGTGTCAG CGCTGCGGTT ATGAAGAAAA TGCTGACATA GTAGGCGCAA CGAATATAGC AATAAGGTAT CTTGTGGAAG TTCAGCAGAT GAACCTGAGA GGGCTGCTTG TCAACCAGCC TAATGTTCCC TGTTTGCAAA AACAGGTAGA GCAAGCTCCT ACCTCTATAG GTAGGAGCAG TTGA
|
Protein sequence | MKLTLSCKFK LWLSQEQKRK LIETAKTYTS AINFVLAENL KDKTTNVKKL HKLYYKTIRK KFSLPSQLAI NVYRQVADIY QTLWAQYNNL LYREQNSNNN SGTAEEFWSK PPKRKTLTVN YTHGRTFSIK YDKNTDTFFV SISSIHGRIK NVRITGWKQH YNYLKHGEIG DPVLVYDKPS KEFYLHIPVT LEIDEKLHKE IAGIDVGERN IVTVVSTAGA RYTIPLPDQV RRTKRHYHEL RSQLMSKGTR SARRKLQKIG MSEKRFVSNF LHKLTKDLVR KHPAALFVME DLSMIRTNRI TYRGNDSEAR RQAEQWPFAE LQNKLEYKSI LYNGICSVKV DPSYTSLSCP VCGHVSKDNR PGHGELFKCQ RCGYEENADI VGATNIAIRY LVEVQQMNLR GLLVNQPNVP CLQKQVEQAP TSIGRSS
|
| |