Gene Moth_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2423 
Symbol 
ID3832174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2545855 
End bp2547276 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content60% 
IMG OID637830342 
Productradical SAM family protein 
Protein accessionYP_431248 
Protein GI83591239 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000017109 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGCAA CCTTAAACCT GCAAATGGTC GACTGGCAGG GTGACCTGCA CTGGTTCGAC 
TTTGAAGACC TGACCCTACT CCTGGATGTC AACAGCGGTG CCGTACACCT CATTGACGCC
GCGGCCCGGG ATGTCCTGGA GGAGTTAAGG GACCGTCAGG AACAGGGCGT ACTGACGGCG
GAGGACTTGA TCACCGTCCT GCCCCGGGCC GTCCGGACCT GGGGAACGGC TACCGTCCTG
GAGGTATGCC GGGAACTGGC AGCCAGACAG GCCGCCGGTA CCCTTTTTAG CGTCGATGAA
GTCGGCAGGA ATTACCGGCC GCCGCAGCCT TCCCTCCAGG CCCTCTGCCT GCATGTTGCC
CACGATTGTA ACATGCGCTG CCGTTACTGC TTTGCCGACG GAGGTCCCTT CGGTGGCGAG
CGCGGTCTCA TGAACCGGGA TACCGGTTAT GCGGCCCTCG ATCTCCTCTT CCGGGAAGCA
GGCAACCGCC CGCGGGTAGA GGTAGACTTT TTCGGTGGGG AACCCCTTTT GAACTTCGGT
GTGGTCCGGG AACTGGTGGC TTACGGCCGG GAAAAGGCAG CCGCAGCCGG TAAGGGAATC
AGCTTCACTC TGACAACCAA TGGCCTGGCC TTAAGCCCGG AAATTGAAAA CTACTTGATC
ACTGAGGGCG TGAGCGTCAT CCTGAGCCTG GATGGTCGCC GGGAGGTCCA TGATTTCAAC
CGGCCTGACG CCGCCGGCCG GGGCACTTAT GAACGAGTTG TCCCCAGGGA GCAGCATTTT
GTCGCTAGCC AGGGGCACCG GGACTACTGG GTCCGGGGAA CCTACACTCG CCAGAATCTT
GATTTTACCA GTGATATCCT GCATATGGTA GAGTTAGGGT TTCGTTACCT GTCCATGGAG
CCGGTGGTAG CGGCCCCTGA GGCCGAATAT GCTATTAAAG AGGAAGATTT ACCCCGGCTG
GCAGGGGAGT ACCGGCGCCT GGCCCGGATT TACCTGGAGA GGGCGAGGGC AGGAAAAGGT
TTTTCCTTTT TTCATTTCAA TATCGATGCC GCTGCCGGGC CGTGCCTGAC AAAAAGACTT
ACCGGTTGTG GTGCCGGTAC CAGTTACCTG GCTGTAACCC CGGCAGGGGA CCTCTATCCC
TGTCACCAGC TCGTCGGGCG TAAGGATTAC TGCCTGGGTA ATGTCCGGGA GGGAATCCGG
CGTCCCGAGC TGCGGGAAGC CTTCCGCCAG GCTTATGTCT ATAACCAGCC GGCCTGCAGT
CGTTGCTGGG CACGCTTTTA TTGCAGTGGC GGCTGCCACG CGGCCAACCT GGCTGCTACC
GGCGACCTGC GCCAGCCGGC ACCAATCGCT TGCGCCATCC AAAAAATGCG CCTGGAAGCC
GCCCTTTACG TTCAGGTAAA AATGGCGGGA AAATTTTGTT GA
 
Protein sequence
MTATLNLQMV DWQGDLHWFD FEDLTLLLDV NSGAVHLIDA AARDVLEELR DRQEQGVLTA 
EDLITVLPRA VRTWGTATVL EVCRELAARQ AAGTLFSVDE VGRNYRPPQP SLQALCLHVA
HDCNMRCRYC FADGGPFGGE RGLMNRDTGY AALDLLFREA GNRPRVEVDF FGGEPLLNFG
VVRELVAYGR EKAAAAGKGI SFTLTTNGLA LSPEIENYLI TEGVSVILSL DGRREVHDFN
RPDAAGRGTY ERVVPREQHF VASQGHRDYW VRGTYTRQNL DFTSDILHMV ELGFRYLSME
PVVAAPEAEY AIKEEDLPRL AGEYRRLARI YLERARAGKG FSFFHFNIDA AAGPCLTKRL
TGCGAGTSYL AVTPAGDLYP CHQLVGRKDY CLGNVREGIR RPELREAFRQ AYVYNQPACS
RCWARFYCSG GCHAANLAAT GDLRQPAPIA CAIQKMRLEA ALYVQVKMAG KFC