Gene Moth_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2336 
Symbol 
ID3832054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2455582 
End bp2456775 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID637830260 
Producttransposase IS66 
Protein accessionYP_431166 
Protein GI83591157 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTG AGGTGCGGCA GGAGCTAAAA ATAATCCCGG CCCAGGTAAA GGTAGTTAAA 
CATATACGCT ACGTCTATGC CTGCCGCCAT TGCGAGCGGG AGGAGCTAAC CACTCCCGTT
GTCACGGCGC CGATGCCGGC CCCCGTACTG CCGGGAAGCC CGGTATCCCC TTCCCTCCTG
GCCTACGTCA TGCATCAGAA ATACGGGGAG GGCTTACCTC TCTACCGCCA GGAGCAGCAG
TTTAAAAGCC TGGGACTTGA ACTCTCCCGT CAGACCCTGG CCAACTGGGT GCTCCACGGG
GCGAACACCT GGTTAACGCA TATTTACGAC CGTCTTCATG AATACCTGCT TAAAAGAGAT
ATCCTCCATG CCGACGAGAC GACCTTACAG GTCCTGAGAG AACCGGGAAG GGAAGCTGCC
ACCAAGTCAT TCCTCTGGCT TTACCGTACC GGGCGGGATG GACCGTCAAT CGTCCTTTAC
GACTACCAGA CCACCCGGGC CAGCAAACAC CCCTGCCGCT TCCTGGCGGG TTTTAAAGGC
TACTTGCACG TCGACGGCTA CGCCGGCTAC AACGAACTGC CGGATGTCAC CCTGGTCGGC
TGCTGGGCCC ATGCCCGGCG CAAGTTCGAC GAAGCCTTAA AAGCCCTGCC GGAAGATAAA
CGTAATGCAG CGGTAGCCGC CCGGGAGGGA CTGGAGTTCT GTAACCGGCT CTTTACCATT
GAACGCGACT TGAAAGATAA AACACCAGAG GAACGCTATC AACTCCGCCA GGTGCGCAGC
AAACCCGTGC TGGACGCCTT TTTGGCGTGG CTAAAAACCC AGAAATCCCG GGTGCTGCCC
AAAAGCTCCT TTGGGCAGGC GATTAACTAC TGCCTGGGCC AGTGGGATAA ACTCACCGCC
TTTTTACAGG ATGGGCGTCT GGAACTCGAT AATAACCGCA GCGAGCGCTC CATCAAGCCT
TTCGTCATCG GCCGCAAGAA CTGGTTATTT GCCAACACCC CGCGGGGTGC CAAAGCCAGC
GCCATTACCT ACAGCATCAT AGAAACAGCT AAGGATAACG GGTTAAATCC CTTCCAATAC
CTCATTTACC TCTTTGAAAG ACTTCCCAAC CTGGACCTCA AGGATAAAGA TGCCCTGGAT
CAACTCCTGC CGTGGTCTGC TTCTTTGCCT CCTCTTTGCC GGATGAATAA TTAA
 
Protein sequence
MSTEVRQELK IIPAQVKVVK HIRYVYACRH CEREELTTPV VTAPMPAPVL PGSPVSPSLL 
AYVMHQKYGE GLPLYRQEQQ FKSLGLELSR QTLANWVLHG ANTWLTHIYD RLHEYLLKRD
ILHADETTLQ VLREPGREAA TKSFLWLYRT GRDGPSIVLY DYQTTRASKH PCRFLAGFKG
YLHVDGYAGY NELPDVTLVG CWAHARRKFD EALKALPEDK RNAAVAAREG LEFCNRLFTI
ERDLKDKTPE ERYQLRQVRS KPVLDAFLAW LKTQKSRVLP KSSFGQAINY CLGQWDKLTA
FLQDGRLELD NNRSERSIKP FVIGRKNWLF ANTPRGAKAS AITYSIIETA KDNGLNPFQY
LIYLFERLPN LDLKDKDALD QLLPWSASLP PLCRMNN