Gene Moth_0718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0718 
Symbol 
ID3830994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp748280 
End bp749584 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content60% 
IMG OID637828649 
Producthypothetical protein 
Protein accessionYP_429579 
Protein GI83589570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000121294 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0965405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAG TAAGCTTGAC GGGGCTGATC CTATTTTTTA TGTTCCTGCC CTGGGGCACG 
GCCTGGGCCG GCCCCACCGC CGGCGAACTC CTGGCCCTGC TGCCGCCGGG AGCTGGGCAG
GCAACGGAAT TGACCCGGGG CGGTTTTGCG GTCATGCTGG CGGTTGCTGC CGGGATTAAG
GGCGATGCCG AAACCGGCGA GCTGCCCGTG GATGTGCCAC CGGAAAGCTG GTACACCCCG
GCGCTGCGGG CTCTGTGGCA ACAGGGGATT ATCCAGGGTT ATCCCAACGG CACCCTGCGG
CCGGAGCAAC CGATCACCTC CCTGGAAGCG GTAATCCTTA CCGCCAGGGC CATGGGATTA
CCCAACGAGA TCGGCGGGCG GGAAGATAAT CCCTTGCCTG GGGAGATACC TTATGGCCTT
AGCCAGTACG CCTTTTTTCA GCAGCAAGGG TTGTTGCCTC CCGGCGAACC CCTCGCACCC
ATGAGCCCGG CGGAAGCGGC CAGGTGGCTG GCGGCAGTCT TCGGTTCCGA AACCAGGGCG
GGAAACCTTT TGGCCCGGTG CCGCCAGGTC CTGGCAGGTA AAGAGGCCAT CCGCGTCAGA
GGGACAGTTA GCCTGCAGTT CTATAACCGG CCGGGGTTAC CAACTACTGC CGAATTGGAC
CGCATGGTGA TTTATGGCGA CGTTCTGAAT GAGGTTAGTA TGACAGGAAA GATGCACCAG
CTGGTGACCC TGCACCTGGA AAAAAAGCAG GAGATGACCA TCGAGCAATT TGTCTCAGGT
GGGTACCTCT ACCGCCGGGT GACAGGCAGT GGAAGGGAAA CCGGCGAATG GCAACGTCTA
TCTTTCGCCC CTGATGTTAG CCTTTTGTTG CGCCAGCAGC AGAACCTGGG CTTGCCGGCC
GGCATTTTTC CCTTCCTGCA TTACCACCTC CTGGGGGAAA GGGAGATTGC AGGCCGGCAC
GTGGTGGGGG TCAGCTTTTA TGCCCGGCAA AATAACCCCG GGGCCCCCGG CGACCTCCTG
CCGCTCCAGG TTTTCAGCGG CAGCGTGGAC GATTATTTCA GCCAGCCGGG TAAACTTATC
CGCTCCCTGT CTTACTGGGG CGTTATTTAC CTCGATTCGG AGAGCCTGCT GCCGGTAAAG
ATCGACCTTA ACCTGGTGAT GGCCTTTGAG CCTGCCCCCG GCGGGCAACC GGCAGTTATG
GCCGCCATGG AGGCCCGTTT TCAGGGGAAG GACTACAACT TTGACGATTT TAAAATAGAG
TTACCGGCGG CTGCGGTGGC GGCGCCAGTA AAAGAAAACC AGTAG
 
Protein sequence
MRKVSLTGLI LFFMFLPWGT AWAGPTAGEL LALLPPGAGQ ATELTRGGFA VMLAVAAGIK 
GDAETGELPV DVPPESWYTP ALRALWQQGI IQGYPNGTLR PEQPITSLEA VILTARAMGL
PNEIGGREDN PLPGEIPYGL SQYAFFQQQG LLPPGEPLAP MSPAEAARWL AAVFGSETRA
GNLLARCRQV LAGKEAIRVR GTVSLQFYNR PGLPTTAELD RMVIYGDVLN EVSMTGKMHQ
LVTLHLEKKQ EMTIEQFVSG GYLYRRVTGS GRETGEWQRL SFAPDVSLLL RQQQNLGLPA
GIFPFLHYHL LGEREIAGRH VVGVSFYARQ NNPGAPGDLL PLQVFSGSVD DYFSQPGKLI
RSLSYWGVIY LDSESLLPVK IDLNLVMAFE PAPGGQPAVM AAMEARFQGK DYNFDDFKIE
LPAAAVAAPV KENQ