Gene Moth_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2111 
Symbol 
ID3833262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2204497 
End bp2205516 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID637830036 
Producturoporphyrinogen-III decarboxylase-like 
Protein accessionYP_430946 
Protein GI83590937 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCATG TGGAAAGGGT GCAGGTAACC CTGGCCCGGG GCCGGGCCGA CCGGGTACCC 
CGGGGGGAAT TCGCCATTGA GCCGGGGCTG GTTGCCGCCC TCCTGGGCCG GGAGGGCCCG
GTTGGCTTTA GCGAAGAGGC GGCCGCTAGG GAACTCCTGG GGATGGACCT CCTGGCCCTG
ACGCCAGGGG CACCCCTGGA GGAACAGGGG GACGGCAGTT ACCGGGATGT CTGGGGCTGC
CGTTACCAGA AGCGGGGCGG CCTGATGGTA CTCCAGGCCC CTGCTATCCA GGACATCCAG
TCCGCCGCCG GCTATACCCT GCCGGACCCC GGCACCTTTG ACCTTGAGGC CATCCGTCGC
TGGCGGGAGG AAACGGATTT TTTTGTTATG GCTTTTGTCG ACGGCCCCTT TCAGGGGACC
GGGAGGCTCT TTGATTTTAC CACCTTTCTC CTGGCGACGG CCGCCAGGGA AGAGGCCGTC
GAGGAACTGG CCGCGGCCGT CGTGGATTTC AACCTGGAAC TGGCCCGCCT CTGCCGCCAG
GCCGGGGCCC ACGCTATTAT CATCGGCGAC GATATCGCCT ACAGCCAGGG TACCTACATC
CGGCCGGACA TCTGGCGGGA GCTCTTTTTA CCCCTTTTAC GGCACCAGGT GGAGGGGATT
AAAGCCCTCG GCCTGCCGGT CATCTATCAC TCCGACGGTA ATCTGAAAGC CCTCTTGCCC
GACCTGGCGG CTCTCGCCCT GGATGGCCTT CAAGGCCTGG AACCGACTGC CGGCATGGAT
ATCGGCACCC TTAAAAAGGA ATACGGGGAA AAGTTGTGCC TCATGGGCAA CTTTGATCTG
GACCTCCTGG TTTCTGGCGA CCCGGAGACC ATAACTGCAG CGGCAGAGCG TTTGCTGGCC
GAAGCAGCCC CGGGCGGCGG GTATATCTTT TCTACCGCCT GCGGCATTTT AAACGCCTCC
CTGCCGCCGG AGAACGTCCG GGCCCTGTAC CGGGCCGTAG CCGATGGAGA CGTTTATTGA
 
Protein sequence
MNHVERVQVT LARGRADRVP RGEFAIEPGL VAALLGREGP VGFSEEAAAR ELLGMDLLAL 
TPGAPLEEQG DGSYRDVWGC RYQKRGGLMV LQAPAIQDIQ SAAGYTLPDP GTFDLEAIRR
WREETDFFVM AFVDGPFQGT GRLFDFTTFL LATAAREEAV EELAAAVVDF NLELARLCRQ
AGAHAIIIGD DIAYSQGTYI RPDIWRELFL PLLRHQVEGI KALGLPVIYH SDGNLKALLP
DLAALALDGL QGLEPTAGMD IGTLKKEYGE KLCLMGNFDL DLLVSGDPET ITAAAERLLA
EAAPGGGYIF STACGILNAS LPPENVRALY RAVADGDVY