Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2336 |
Symbol | |
ID | 3832054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2455582 |
End bp | 2456775 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637830260 |
Product | transposase IS66 |
Protein accession | YP_431166 |
Protein GI | 83591157 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTG AGGTGCGGCA GGAGCTAAAA ATAATCCCGG CCCAGGTAAA GGTAGTTAAA CATATACGCT ACGTCTATGC CTGCCGCCAT TGCGAGCGGG AGGAGCTAAC CACTCCCGTT GTCACGGCGC CGATGCCGGC CCCCGTACTG CCGGGAAGCC CGGTATCCCC TTCCCTCCTG GCCTACGTCA TGCATCAGAA ATACGGGGAG GGCTTACCTC TCTACCGCCA GGAGCAGCAG TTTAAAAGCC TGGGACTTGA ACTCTCCCGT CAGACCCTGG CCAACTGGGT GCTCCACGGG GCGAACACCT GGTTAACGCA TATTTACGAC CGTCTTCATG AATACCTGCT TAAAAGAGAT ATCCTCCATG CCGACGAGAC GACCTTACAG GTCCTGAGAG AACCGGGAAG GGAAGCTGCC ACCAAGTCAT TCCTCTGGCT TTACCGTACC GGGCGGGATG GACCGTCAAT CGTCCTTTAC GACTACCAGA CCACCCGGGC CAGCAAACAC CCCTGCCGCT TCCTGGCGGG TTTTAAAGGC TACTTGCACG TCGACGGCTA CGCCGGCTAC AACGAACTGC CGGATGTCAC CCTGGTCGGC TGCTGGGCCC ATGCCCGGCG CAAGTTCGAC GAAGCCTTAA AAGCCCTGCC GGAAGATAAA CGTAATGCAG CGGTAGCCGC CCGGGAGGGA CTGGAGTTCT GTAACCGGCT CTTTACCATT GAACGCGACT TGAAAGATAA AACACCAGAG GAACGCTATC AACTCCGCCA GGTGCGCAGC AAACCCGTGC TGGACGCCTT TTTGGCGTGG CTAAAAACCC AGAAATCCCG GGTGCTGCCC AAAAGCTCCT TTGGGCAGGC GATTAACTAC TGCCTGGGCC AGTGGGATAA ACTCACCGCC TTTTTACAGG ATGGGCGTCT GGAACTCGAT AATAACCGCA GCGAGCGCTC CATCAAGCCT TTCGTCATCG GCCGCAAGAA CTGGTTATTT GCCAACACCC CGCGGGGTGC CAAAGCCAGC GCCATTACCT ACAGCATCAT AGAAACAGCT AAGGATAACG GGTTAAATCC CTTCCAATAC CTCATTTACC TCTTTGAAAG ACTTCCCAAC CTGGACCTCA AGGATAAAGA TGCCCTGGAT CAACTCCTGC CGTGGTCTGC TTCTTTGCCT CCTCTTTGCC GGATGAATAA TTAA
|
Protein sequence | MSTEVRQELK IIPAQVKVVK HIRYVYACRH CEREELTTPV VTAPMPAPVL PGSPVSPSLL AYVMHQKYGE GLPLYRQEQQ FKSLGLELSR QTLANWVLHG ANTWLTHIYD RLHEYLLKRD ILHADETTLQ VLREPGREAA TKSFLWLYRT GRDGPSIVLY DYQTTRASKH PCRFLAGFKG YLHVDGYAGY NELPDVTLVG CWAHARRKFD EALKALPEDK RNAAVAAREG LEFCNRLFTI ERDLKDKTPE ERYQLRQVRS KPVLDAFLAW LKTQKSRVLP KSSFGQAINY CLGQWDKLTA FLQDGRLELD NNRSERSIKP FVIGRKNWLF ANTPRGAKAS AITYSIIETA KDNGLNPFQY LIYLFERLPN LDLKDKDALD QLLPWSASLP PLCRMNN
|
| |