Gene Moth_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1724 
Symbol 
ID3833024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1769934 
End bp1771142 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content52% 
IMG OID637829649 
Producttransposase 
Protein accessionYP_430569 
Protein GI83590560 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000337952 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.570095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTAG CCATGGATAA TATGCGCTTC TTCCGGGCCG GTCCCGCAGC CCTTATCTCC 
AGGTTATGTG ACGTCCTGAA GATAGCAGAA ATCATAGATG CCGTAGTTGA CTGGGACCCG
GCCCAGTGCC ATCTTTCCCC GGGAAATCGG GTTAAGGCGC TGATCATTAA TCTCCTGGTA
GACCGGGAGG CCCTCTATCA TGTGGAGCGC TTTTATGAGA ACCAGGACCT GGAGGTTTTG
TTTGGAGCTG AGCAACAGGT CCGGCCTGAA GATTTTAACG ATGATGCTCT GGGCCGGGCC
CTGGATAAAC TCTTCACCAG CGGCCAGCTG AAGAAGTTGT TCTCCAGCAT TGCTTTAACT
GCCGCCGCCA CCCATAACGT ATCCATTGCG GGCATCCACG TCGATACCAC CTCCATTTCC
GTGCAAGGAG CCTATGATGG TGAAGGAGAT TTAGATATCA CTTTTGGTTT TAGTAAAGAT
CATCGCCCCG ACCTCAAACA GTTTCTCATC GGCTTGACCG TAAATAGAGA TGGGTTGCCC
ATTTTGGCTC AGAGCTTGGA CGGCAATAGC AGTGATAAGT CCTGGTACCC CCAGGTTATA
GAGGAATTGG TCCAAACCTT CAAGCCGGAA AAGCTTAAAG AGGTCATTTT CGTGGCGGAC
TGCGCCCTGG TAACTAAGGA TAACCTGGCT CTTTTGGTTC AGGAGGAAGG TAACAAACCC
GCCCTCCAGT TCATCTCCCT GTTACCGGAG AACTTCGGCC TTAACAAGGA GATTAAGGCT
GAGGCCTTCC GCACCGGCAC CTGGCAGGAG ATCGGGAAAC TAAGCCCCAA GAAAGATGCT
GCTTGCTATA AAAGCCAGAG CTTTGTCCGG GAAATAGACG GCCGCGATTA CCGGTTAATC
GTGGTCCACT CCACAACCCT GGATAAGCGC AAAGAGAATA GTCTCTTGAA AAAGTGGGCT
AAGCAAAGAG AAGTTCTGGA AAAGGCCGCC AAAGATCTTT CCCGCCGTCC CTTCGCCTGT
AAGGCCGACG CCAGGAAAGC CATAGAACTC TTCTTGAGGG AATACCGCCA CCAACCTTTC
ATCCTAAAGG GCACAGTTGA TGAAGAAATA GTGAGCAACT ACTATCAGGG GCCGGAGGTG
ATTAGAGCCC TTGAACTTGC CGGCTTCGGT AAGGAAATAT ATCTTTTTCC ACCTCGCGGT
GGCGGGTAG
 
Protein sequence
MPVAMDNMRF FRAGPAALIS RLCDVLKIAE IIDAVVDWDP AQCHLSPGNR VKALIINLLV 
DREALYHVER FYENQDLEVL FGAEQQVRPE DFNDDALGRA LDKLFTSGQL KKLFSSIALT
AAATHNVSIA GIHVDTTSIS VQGAYDGEGD LDITFGFSKD HRPDLKQFLI GLTVNRDGLP
ILAQSLDGNS SDKSWYPQVI EELVQTFKPE KLKEVIFVAD CALVTKDNLA LLVQEEGNKP
ALQFISLLPE NFGLNKEIKA EAFRTGTWQE IGKLSPKKDA ACYKSQSFVR EIDGRDYRLI
VVHSTTLDKR KENSLLKKWA KQREVLEKAA KDLSRRPFAC KADARKAIEL FLREYRHQPF
ILKGTVDEEI VSNYYQGPEV IRALELAGFG KEIYLFPPRG GG