Gene Moth_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1246 
Symbol 
ID3833041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1287089 
End bp1288063 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content61% 
IMG OID637829182 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_430103 
Protein GI83590094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.402082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGTT TTCCTATTTG CCGGCCGCGG CGCCTGCGGC AGAACGCAAC ACTGAGGGCC 
ATGGTCCGGG AGACGGAAAT TAACCCCCGC GACCTGATTT ATCCCCTCTT TGCCATCCAC
GGCCGGGGAG TAAAGAATCC CGTACCCTCC CTGCCCGGCG TTTACCAGCT TTCCATCGAT
AATCTGGTAC AGGAAGCCGG GGAAGTGGTG GCCGCCGGCA TCCCGGCAGT CCTTCTCTTC
GGCATCCCGG CTACCAAGGA CGAGGTCGGT TCCGGGGCTT ACGACCCCCA CGGCATCGTC
CAGGAGGCCG TGCGGGCCCT GAAAAAGGCC TACCCGGAAC TCCTGATCAT CACCGATGTC
TGCCTGTGCG AGTACACCAG CCACGGCCAC TGCGGCCTGG TGGACGACGG CCAGGTCCTC
AACGACCCTA CCCTGGAGTT AATAGCTAAA ACCGCCCTCT CCCACGTCGA AGCCGGGGCC
GATATCGTAG CGCCCTCGGA TATGATGGAC GGCCGGGTGG GGGCCATCCG CAAGCTCCTG
GACGCCAACG GTTTTACCCA GACCCCCATC CTGGCCTACT CGGCCAAGTA TGCCTCCGTT
TTTTACGGGC CCTTCCGGGA TGCTGCCGGT TCGACACCCC GGTTCGGCGA CCGCCGGGGC
TACCAGATGG ACCCTGCCAA CAGCGACGAG GCCCTGCGGG AAGTGGAACT CGACCTCCAG
GAGGGGGCCG ACATGGTCAT GGTGAAACCG GCCCTGCCTT ACCTGGATAT AATCCGCCGG
GTGAAAGATA ACTTTAACGT CCCCCTGGCG GCCTACCAGG TCAGTGGCGA GTACGCCATG
CTGAAAAGCG CCGCCGCCAA CGGCTGGCTG GATGAAGAGA AGAGCGTCCT CGAAGCCCTG
ACGGCCATCA AAAGGGCCGG TGCGGATCTT ATTATCACTT ATTACGCTAA AGATGTAGTC
AAGTGGCTTA AATGA
 
Protein sequence
MTGFPICRPR RLRQNATLRA MVRETEINPR DLIYPLFAIH GRGVKNPVPS LPGVYQLSID 
NLVQEAGEVV AAGIPAVLLF GIPATKDEVG SGAYDPHGIV QEAVRALKKA YPELLIITDV
CLCEYTSHGH CGLVDDGQVL NDPTLELIAK TALSHVEAGA DIVAPSDMMD GRVGAIRKLL
DANGFTQTPI LAYSAKYASV FYGPFRDAAG STPRFGDRRG YQMDPANSDE ALREVELDLQ
EGADMVMVKP ALPYLDIIRR VKDNFNVPLA AYQVSGEYAM LKSAAANGWL DEEKSVLEAL
TAIKRAGADL IITYYAKDVV KWLK