Gene Moth_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1317 
Symbol 
ID3831804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1361406 
End bp1362578 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content50% 
IMG OID637829253 
Producthypothetical protein 
Protein accessionYP_430173 
Protein GI83590164 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.600188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA AGGAAAAGCT TTATCAAGAG AGATTTAATC GTTACGCTAC CGCCATGGAA 
TGCGGTAAGC CCGACAAGGT TCCGGTTGTT TTTCCACTGG GCGAATGGGT GGTTAGGTAC
ACAGATAGCA CTATTCAAGA AGCATATTAT GATTTCGATA AAGAAACAGA AATTATCTGC
AATATTATTA AAGATCTTGA TTTTGATATA ATGTATGGCT CCCTCCTATT ATGGTGGCCA
CCCATGTTTG ATGCCCTTGG CTCCAAATAC TATAAGTTCC CCGGCCTCAG ACTGGAAGCG
AACGTAGGTT ATCAGTATGT AGAAGAAGAG TACATGAAAC CCGAAGACTA TGACGATTTT
ATCGCCAATC CTACCGCATG GCTGGCTACT AAATTTTTAC CCCGTATCAG TGAGGAGTTT
GCCGAACCGG GGTCATACCG GGCTACGGTG GCATTAATTA AGAGCGCCGC GGGATTTGCC
ATGGCCAATG CCAACGCGGC TATAAAAGGT GAAAAATTGG CAAAAGAATA TGGTATAGTA
CCTTATGTGA CGGGAATGAC CAAAGCCCCG TTTGATACTC TGGGGGATGC CTTGCGGGGG
ATGAAAGGGA TCTTGCTGGA CCTTCGTCGG CGGCCCGACA AGGTTTTAGC CGCCTGCGAG
GCGATCGTAC CCCATAATAT CGCCTATGCC AGGATTACCG CAGCAGGCGA TACATCTCTG
CCTGTGTTTG CCCCCCTTCA TCGCGGGGCG TACCCGTTCC TGAGCATGGA ACAGTGGGAA
AAATTCTACT GGCCGTCCTG GAAAGCGGTT ATCGAGGGCC TCTGGGCGCA GGGGAAGAGG
ACCTTTTTCT TCGCCGAAGG AGACTGGACG CCGTACCTGG AGAAGATAGC CGAGCTGCCC
GAAAAAAGTA TCGTCTTTAT CATTGATAAT ACTGATGCCA AAAAAGCGAA GAAAATTCTC
GGCGGTAAGT TCTGCCTGTG GGGTGGCGTT CCCACTACGC TTCTGACTTA CGGTACGCCG
GCACAAGTGA AGGATTGTGT GAAGCAGGCT ATTGATGAAC TGGCCTGTGA CGGTGGCTTT
GTCCTTGCAC CGGGCGGGGT CGTTCTGGGT GATGCTAAGC GGGAAAACAT CCTGGCGATG
CTCGAAGCAG CAAGAGAATA CGGGGTTTAC TAA
 
Protein sequence
MSDKEKLYQE RFNRYATAME CGKPDKVPVV FPLGEWVVRY TDSTIQEAYY DFDKETEIIC 
NIIKDLDFDI MYGSLLLWWP PMFDALGSKY YKFPGLRLEA NVGYQYVEEE YMKPEDYDDF
IANPTAWLAT KFLPRISEEF AEPGSYRATV ALIKSAAGFA MANANAAIKG EKLAKEYGIV
PYVTGMTKAP FDTLGDALRG MKGILLDLRR RPDKVLAACE AIVPHNIAYA RITAAGDTSL
PVFAPLHRGA YPFLSMEQWE KFYWPSWKAV IEGLWAQGKR TFFFAEGDWT PYLEKIAELP
EKSIVFIIDN TDAKKAKKIL GGKFCLWGGV PTTLLTYGTP AQVKDCVKQA IDELACDGGF
VLAPGGVVLG DAKRENILAM LEAAREYGVY