Gene Moth_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1447 
Symbol 
ID3832616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1489248 
End bp1490918 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content53% 
IMG OID637829380 
Productcobalamin B12-binding 
Protein accessionYP_430300 
Protein GI83590291 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase
[COG5012] Predicted cobalamin binding protein 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR01463] methyltransferase, MtaA/CmuA family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0642463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTCA TTGACCTGGT GAATCAATCC GGGCGGCATC TGGTATTTTT CATTGGCAGT 
TTTCCTGGTG CAGCCATGAA AGGCGTTACA TTGAAAGAAG TATACTTTTC CGCTTCTCTC
CAGGCAGAAA CGGCCTGCTA CCTGGCGGAA CGTTTCCAGA TCGATTTTAT CCGTCCGGTT
ACTGACCTGG TGGTAGAGGC TGAGGCTATG GGCTTAAAGG CGCGTTATCC AGATAATGGC
GCCCCGGTGC TGGTAGAGCA TCCAATAACT GGACCAGAGG CGCTGGCCAA CTTAAACTTA
CCGGAACCCG GCCGTGACGG CCGTCTCCCG ATAAATCTTG AAGCCATTCG CTTGATAAAG
CAGCGGACCG ACAAACCGGT GATAGGTTCC TTGACAGGCC CTTTTACTTT AGCCGGTGCC
CTCTGCGATC CGGCTGGCGT GGCCATGAAA ACAATTACTG ACCCCGAGTT TCTGCATGCT
CTCCTGGCCT TTTGTACCCG GGCTTTGAAA CGATACGGGG AAGCCCTGCT GGCAGCGGGG
GCGGATATCC TCTGGATTTC TGAACCCCTG GGCTCTTTGC TGTCACCAGC ACAGTTCTGG
CAGTTTTCCG GCCGGTATAT CCAGGAGATT TTTGCGGCTT TCCCGGCCAT GGATATACTC
CATATCTGTG GCGATACCAG TTACATGATT AAAGAAATGC TGGCCACAGG CGCCCAGGGT
TTGAGCCTGG ACAGCAGGGT TAGTTTACCT GTCCTGGCAA CCCAAATGCC GGAGGATGTA
GTTCTAATCG GTAATATCGA TCCCGTCGGG GTGATGCTGG AAGGGTCACC CCAGCAGGTA
GTAAAAGCTA CTGCTAACCT GCTAACGGCT ATGCTGCCGT TCAACAACTT TATTGTGAGT
ACGGGATGCA CTTTGCCCTT TGAAGTCCCG GCAGCGAATA TTAGTGCTTT TGTGGAAACA
GCCAGGAACT TTCCCCGTTT ATCTCCCTCC CAGGCCCGGT TGTTGCTTTC TTTACGACGA
GCTTTGCTTG AAGGCGATAG TGAAGGAGTA ACAACGCTGA CCAGGAAAGG GTTGCAACTG
GAGGTGGATG CAATTACCAT TCTCGAGGGA GGCCTGATCC AGGGAATAAA CCGGGCCGGG
GAATTATACC AGCAACAGAA AGTATTTATC CCTGAGCTCT TGTTGATTAG CCAGGCCATG
TACGCCGGCC TGGAGGTTAT CAGGCCGGTA CTGGTGCAAA AGCGGCATGG CACCCGGGGC
GAGATTGTGC TGGGAACTGT TCAGGGAGAC CTGCACGATA TTGGTAAAAA TCTGGTTGGT
TTGATGTTAA CAGCTAATGA TTATGAGGTT ATCGACCTGG GTAAAGATGT AGCCCCGGAG
CAGTTTGTTG AAGCTGTACG GGCATGCCGG CCCCAGGTAG TGGGTCTGTC GGCTTTGACC
ACCACAACCA TGAAGTCCAT GGAAAAGACC GTGCGGGCGT TAAAACAAGG GGGTCAAGGG
GAGCAAGTTA AAGTTATAGT GGGAGGAGCC CCCATTACCC CCGAGTTTGC CCGGCGGATT
GGCGCTGATG CCTATGCTCC CGATGCTGCG GCGGCGGTTA CTGAGGTGGC TAATTTAATA
GATAAGGTGA AGGAGCATCA AGATGGCCAA TCAACCACTG GCGTATTTTG A
 
Protein sequence
MRLIDLVNQS GRHLVFFIGS FPGAAMKGVT LKEVYFSASL QAETACYLAE RFQIDFIRPV 
TDLVVEAEAM GLKARYPDNG APVLVEHPIT GPEALANLNL PEPGRDGRLP INLEAIRLIK
QRTDKPVIGS LTGPFTLAGA LCDPAGVAMK TITDPEFLHA LLAFCTRALK RYGEALLAAG
ADILWISEPL GSLLSPAQFW QFSGRYIQEI FAAFPAMDIL HICGDTSYMI KEMLATGAQG
LSLDSRVSLP VLATQMPEDV VLIGNIDPVG VMLEGSPQQV VKATANLLTA MLPFNNFIVS
TGCTLPFEVP AANISAFVET ARNFPRLSPS QARLLLSLRR ALLEGDSEGV TTLTRKGLQL
EVDAITILEG GLIQGINRAG ELYQQQKVFI PELLLISQAM YAGLEVIRPV LVQKRHGTRG
EIVLGTVQGD LHDIGKNLVG LMLTANDYEV IDLGKDVAPE QFVEAVRACR PQVVGLSALT
TTTMKSMEKT VRALKQGGQG EQVKVIVGGA PITPEFARRI GADAYAPDAA AAVTEVANLI
DKVKEHQDGQ STTGVF