Gene Moth_0381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0381 
Symbol 
ID3832625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp385892 
End bp387322 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content47% 
IMG OID637828318 
Productcarboxylyase-like protein 
Protein accessionYP_429258 
Protein GI83589249 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTATG CTGATCTGCG TCAATTTCTA GAGCGCTTGG AAGCTGAAGG ACAACTCGTC 
AGGGTAAAAG AACCGGTTCA GGCTGAACCA GACTTAAGCT CTATTGGAAG GGCAGCAGCT
AATTTAGAAC AGGCACCAGC AGTACTAGTG GAAAACATTA AAGGTTATTA CCAGAAAGTT
GCTTTGAACG TGCATGGTTC CTGGGCCAAC CATGCCCTCA TGTTCGAGCT ACCCAAAGAT
ATGCCGGTAA AAGAACAGTT TATGGAAATT AGCCGCCGAT GGGATAATTT TCCTGTTCCT
GTAAGACGGG TGTCCAGCTC CCCGGTTAAG GAGATAAAGC ATCACGAACA GATAAACCTG
TTTGAACTTA TGCCTCTATT CAGGGTAAAC AAGTTCGATG GGGGCTTCTA CCTCTCCAAA
GCGGCAGTGG TTAGCCGCGA TCCCGAGGAT CAAGATAACT TTCACAAGCA GAATGTTGGC
ATTTACCGTA TCCAGGTAAA AGGTAAAGAT CTTCTCGGAA TTCAAGTACT CCCCTTCCAC
GATATTGCTA TCCACTTACG CAAGGCAGAG GAGCGCAATC AGCCATTGCC AGTTGCTATT
GCTGTAGGCA ATGATCCGGT ACTGACTTTC GTTGCCAGCA CGCCAATGGC CTATGAAGAG
TCAGAATACG AAATGGCCGG GGCCTTGCGC GGTGAACCAT TTGAAGTTAT TAAGGCAGAA
ACGTCTAACC TTGATGTCCC GGCAGGAGCA GAACTTATCC TGGAAGGCGA AATTCTTCCC
CGCCGTCGTA GTCCGGAAGG ACCGTTTGGC GAATTTCCTG GAAGCTACTC GGGCGTTAGA
ATGCAAGCGG AAGTAAAGAT TCATACGGTT ACCCACCGGG TGGACCCGAT TTTCGAGAAC
CTGTATCTTG GTGTACCTTG GACGGAAATA GATTATCTTC AGGCACTGAA CACCAGCATT
CCTCTATATA AACAAATTAA AGCCTCAATG CCTGAGGTAG TAGCAGTAAA TGCTATGTAT
ACGCACGGTA TAGGGGTGAT TATTTCTACC AACTGCCGTT TTGGCGGGTA CGGTAAAGCA
GTAGCAATGC GGCTACTGTC CACCCCCCAC GGCATGCCCT ATTCCAAAAT TATCATTGTG
GTAGATGACT TTGTCGATCC CTTTAATTTA GAGCAGGTCA TGTGGGCATT GACAACCAAG
GTGCGCCCTG ATAAAGACGT AATACTGATT CATAATGCTC CTGGTATGCC GCTGGATCCT
TCATCCGATC CACCAGGAAT GCATACAAAA TTGATTATTG ATGCTACTAC TCCGGTACCG
CCAGACGTTG TGAGCAGGGA GGTGGAGTTA GTAGATACCC CTGCTAAGAC GGCTTTGTGG
GAAAAGATAC TTAAAGAACT CCATCGAAAT AAAAAAGGAG GGAGCATGTA A
 
Protein sequence
MPYADLRQFL ERLEAEGQLV RVKEPVQAEP DLSSIGRAAA NLEQAPAVLV ENIKGYYQKV 
ALNVHGSWAN HALMFELPKD MPVKEQFMEI SRRWDNFPVP VRRVSSSPVK EIKHHEQINL
FELMPLFRVN KFDGGFYLSK AAVVSRDPED QDNFHKQNVG IYRIQVKGKD LLGIQVLPFH
DIAIHLRKAE ERNQPLPVAI AVGNDPVLTF VASTPMAYEE SEYEMAGALR GEPFEVIKAE
TSNLDVPAGA ELILEGEILP RRRSPEGPFG EFPGSYSGVR MQAEVKIHTV THRVDPIFEN
LYLGVPWTEI DYLQALNTSI PLYKQIKASM PEVVAVNAMY THGIGVIIST NCRFGGYGKA
VAMRLLSTPH GMPYSKIIIV VDDFVDPFNL EQVMWALTTK VRPDKDVILI HNAPGMPLDP
SSDPPGMHTK LIIDATTPVP PDVVSREVEL VDTPAKTALW EKILKELHRN KKGGSM