Gene Moth_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1100 
Symbol 
ID3833066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1127451 
End bp1128851 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content61% 
IMG OID637829028 
Productcobyrinic acid a,c-diamide synthase 
Protein accessionYP_429957 
Protein GI83589948 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1797] Cobyrinic acid a,c-diamide synthase 
TIGRFAM ID[TIGR00379] cobyrinic acid a,c-diamide synthase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATACA AGCGGATTGT TATTGCCGGT ACCAGGAGCG GCGTCGGCAA GACAAGTATT 
GCCACCGGTT TAATGGCCGC CCTGGCCGCC AGGGGGTTAA AGGTCCAAGG GTTTAAGGTC
GGTCCCGACT ACATCGACCC CGGTTACCAC ACCCTGGCTA CGGGCAGGCC TTCCCGCAAT
CTGGATACCT ACTTAATGAC CCCGGCGGCC GTCCTGGAGG CCTTTGAGCG CGCCGCTGCC
ACCAGCGATA TAGCCGTTAT TGAAGGAGTT ATGGGGCTTT ATGACGGTCA CCGTGACACC
GGCAGCGGGA GTACGGCGAC CATTGCCCGC CTGCTGGCCG CCCCTGTCCT GCTGGTGGTG
GACGCTACTT CCCTGGGGCA GAGTGTGGCC GCCGAGGTCC TGGGTTACCG CTCCCTGGAC
CCGGGGGTAA ACCTGGCCGG GGTCATCCTG AACCGGGTCA GTAGTGAGGG TCACCTGGAG
GTGCTCCGCC AGGCTATAGA AGAATATACC GGCATACCGG TAGTTGGCTG GTTGCGACGG
GGCTCCCTGC CTCCCCTCCC TTCCCGGCAC CTGGGGTTGA TCCCGGCCGG GGAACAGGAG
GACCTGAGGC CCGTCCTGGC GGAACTGGCC GCTACCATAG CCACCGGCCT GGACCTGGAG
AGGGTACTCG ACCTGGCAAC ACAGGCCGGC CCCCTGCCGG CAGGGGGAAG CCGCCTTTTT
GCTACCGCCG GTGCTGGGGT GAGGGAGAAA ATCCCGGTTG CTGTGGCCCT GGATAAAGCC
TTTAACTTTT ACTACCAGGA TTCTCTAGAT TACCTGGCGG TTCTCGGAGC CGAATTGCTA
CCATTCAGCC CCCTAGAGGA CGACAGGTTG CCCGCTGGAG CCGCCGGGAT AATTATTGGC
GGTGGGTTCC CGGAGATTTT CCTCGCTCCC CTGACGGATA ACAGGCCCCT CCTCGCTGAC
CTCCGTCGGC AGGTCGCCCG GGGGATACCC CTTTATGCTG AGTGTGGCGG CCTCATGTAC
CTGGCCCGGG AGATCATTGA CCTGGAAGGC AGCAAGTGGC CCATGGCGGG CATTGTACCC
GCCGCCTGCC GTATGCAAAA GAGCCTGGCC GGCCTCGGTT ACAGGGAGGC CCGCCTCTGC
CGGGAAACCC TCGTGGGCCA CCGGGATGAT TGCCTTCGGG GGCATGAATT TCATTATTCC
ACCATGACAA GTAAGGATAC AGACTTCCCG CCGGCTTACA CCTGGAAACA CCGGGGGTCA
ATCTGGTACG ATGGCTACGG GACACGGCAG ATAGTAGCCT CTTATTTGCA CTTGCATTTC
CTGGGCAATG TAGGAGCCGC TCAAAATTTC CTGGCCGCCT GCCGGGCATA CAAAGGAGGA
AGAAACCTTG AAACTGCTTG A
 
Protein sequence
MTYKRIVIAG TRSGVGKTSI ATGLMAALAA RGLKVQGFKV GPDYIDPGYH TLATGRPSRN 
LDTYLMTPAA VLEAFERAAA TSDIAVIEGV MGLYDGHRDT GSGSTATIAR LLAAPVLLVV
DATSLGQSVA AEVLGYRSLD PGVNLAGVIL NRVSSEGHLE VLRQAIEEYT GIPVVGWLRR
GSLPPLPSRH LGLIPAGEQE DLRPVLAELA ATIATGLDLE RVLDLATQAG PLPAGGSRLF
ATAGAGVREK IPVAVALDKA FNFYYQDSLD YLAVLGAELL PFSPLEDDRL PAGAAGIIIG
GGFPEIFLAP LTDNRPLLAD LRRQVARGIP LYAECGGLMY LAREIIDLEG SKWPMAGIVP
AACRMQKSLA GLGYREARLC RETLVGHRDD CLRGHEFHYS TMTSKDTDFP PAYTWKHRGS
IWYDGYGTRQ IVASYLHLHF LGNVGAAQNF LAACRAYKGG RNLETA