Gene Moth_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0110 
Symbol 
ID3832000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp108857 
End bp110065 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID637828044 
Productdihydropteroate synthase 
Protein accessionYP_428992 
Protein GI83588983 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGGC ATGACGAGAT AGAAGTTATT ACCGTAGCCG ATGCCGCCAC TGCCGCCAGG 
TTGATGCAGG AAGTAGGTGC TGACGCGGGA GGGATCGACC TGATGCTTCC TAAGGCCTGC
TTCTACTGCC TCAAACTCAA GGATGTACCG GCCCGGGCGG CCAATATCCT CAAGCAGGAG
ATCCTGGCCC GGGGCGGCGA GGCCGCCATG GCTGCCGGGG TAGCCGGCTG GTCGGTAGAT
AAGACCGATG TGCTCATTTT CGCCACCCGG CGCCAGCTGG AACTCCTGGT GGCGAAACTT
CCCGTCCAGG GCTTCGGCTT AGATAAACTT GCGGCTGGTA TTAAAACTAC CCTGGCCAAC
TGGGAAAGGA ACGACGTCCG GAAAATAGGT TGTCCCCGGG GTCCTCTGGT TTTGGGTAAG
AGGACCCTGG TTATGGGCAT CGTCAATGTC ACCCCGGATT CCTTTTCCGA CGGTGGCCAA
TTCTATGACC CGGGTGCGGC GGTGGAACAC GCCCATCAGC TGGTCGCTGA AGGGGTTGAC
ATTATCGATG TCGGCGGGGA ATCCACCCGT CCCGGTCATG AACCGGTGAG TGCCGCAGAG
GAATGGCGGC GCCTGGAGCC CGTCCTGACA CGCCTGACCC GGGAAATCAA GGTGCCCATA
TCGGTAGATA CCTATAAAGC CAGTACTGCT CGCCGGGCAC TGGAGATGGG AGTTGACATA
ATCAATGATA TCTGGGGCTT CCGGGCTGAC CCGGAAATGG CCCGGGTTTG CGGCCAGTAC
CAGGCGGCAG TTGTCCTTAT GCATAACCAG GAGGGGACCA GCTACAAGGA CCTGATGTTT
GACATCCTTA CCTTCCTGCG CCAGAGCATC CGAATGGCTG AGGACAACGG CGTTCCCCCG
GAAAAAATTA TTGTCGATCC GGGTATCGGT TTTGGAAAGG ACCTGGACCA GAACCTGGAG
GTGATGGGGC GTCTGGAGGA GTTTAAGGTC CTGGGGAAAC CGGTCCTGCT GGGAACTTCC
CGTAAATCAA TGATCGGCAG GACCCTGGAC CTGCCCGTTG ATCAGCGCCT GGAGGGAACG
GCAGCCACCG TCGCCCTGGG TATTGCCCGG GGCGTGGATA TCGTTCGCGT CCACGACGTC
CGGGCCATGG TGCGGGTAGC CCGGATGACC GACGCTATAG TTCGCCGAAA AAAGGGCTCT
ACCCGATGA
 
Protein sequence
MSWHDEIEVI TVADAATAAR LMQEVGADAG GIDLMLPKAC FYCLKLKDVP ARAANILKQE 
ILARGGEAAM AAGVAGWSVD KTDVLIFATR RQLELLVAKL PVQGFGLDKL AAGIKTTLAN
WERNDVRKIG CPRGPLVLGK RTLVMGIVNV TPDSFSDGGQ FYDPGAAVEH AHQLVAEGVD
IIDVGGESTR PGHEPVSAAE EWRRLEPVLT RLTREIKVPI SVDTYKASTA RRALEMGVDI
INDIWGFRAD PEMARVCGQY QAAVVLMHNQ EGTSYKDLMF DILTFLRQSI RMAEDNGVPP
EKIIVDPGIG FGKDLDQNLE VMGRLEEFKV LGKPVLLGTS RKSMIGRTLD LPVDQRLEGT
AATVALGIAR GVDIVRVHDV RAMVRVARMT DAIVRRKKGS TR