Gene Moth_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0231 
Symbol 
ID3832559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp229959 
End bp231398 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content60% 
IMG OID637828167 
Productradical SAM family protein 
Protein accessionYP_429109 
Protein GI83589100 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.721774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATTGGG GACTAATAAA CCAAGCCAAA CCAGGGAAGG AGAGGATAGC CTTGATTAAC 
GCCCGTGCCC ATTTAGAGCT GGCCAAAAAG TATGTAACCG AAAAGGTCCT CCAGGAAGCC
TTCAGCTATA TGGAAAAAAA CCCAGAGGAG AATTTTCCCC GCATCCTGAA TACCGCCCGG
TTATTGGCCA GGGAGGAGGT ACATAAGCAA CAGATCGCCA AAGTGCTGGA GGCCTACCGG
ACCAACCCCA GCATCCACGC CTACGTGAAT CGCCTCTTCA AAGTGCATCC TAATGTTAAA
CAGCGCCTGA TCTACAACTG GTTCGTCAAC GCCATGCTCC TCGGTATACC TCGCCAGCAC
CAGGTCTCCC AGGAAACCGG GGTTCATATA CCTAATTTCT TCCTTCTGGA CCCCACCAGC
GACTGCAACC TGCGCTGTCA CGGCTGCTGG GCCGGGGAGT ATGCCCACCA CGACACCCTG
GAACTGGATC TGGTGGACCG CCTCTGCCGC GAGGCCAAGG CGGTCGGCAT TTACTGGCTG
GCCATGTCCG GCGGCGAGCC CTTCCGCTGG CCCCATCTCT TTGAACTGGC CGAGCGCCAT
CCCGATATGG CCTTTATGCT CTATACCAAC GGCACGCTCA TCGATGACGC CGTGGCCGAC
CGCATGGTGG AGGTCGGCAA CATCACGCCG GCCATTAGCC TGGAAGGCTG GCGGGAACGC
ACCGACGCCC GCCGGGGCCG GGGTGTCTTT GACCGGGTAA TGGCCGCCAT GGACCGCCTG
CGGGAGCGGG GTCTGGTCTT CGGGGTTTCC ATCACCATTA CCAGGGAAAA CGCGGAGGAG
GTCACCAGCG ATGAGTTCAT CGACTTCCTG TTGGAGAAGG GTGTAGTCTA CGGCTGGAGT
TTCCATTATA TACCCATCGG CCGGGATCCC AATCCCGAAC TCATGGTCAC TCCCGAGCAG
CGGGCCTACC TGGCTGAGCG CATTCCCTAT ATTCGTAACC ACAAGGGGCT GCAGATTGCC
GATTTCTGGA ATGACGGCGA GCTGACCCTG GGATGCATCG CCGGCGGCCG GCGCTACTTC
CACATCACCG CCAGCGGGGC AGTGGAGCCC TGCGCCTTCA TTCACTTCTC CATGGACAAC
ATCAAAGAGA AGAGCCTGCT GGAGGTTCTC CAGTCGCCCC TCTTCCGGGC CTATCAGCGC
CGCCAGCCGT TTAGCGATAA CCTGCTCAGG CCCTGCCCCC TCATCGATGT CCCTGAGGGC
CTGCGGCAGA TCGTAGCCGA AACCGGGGCT AAACCAACCC ACCCGGGCGC AGATACAGCC
CTGAAAGGTT CTATCGGCGC CTATCTGGAC GCCAACGCCG CCCGCTGGGG CGAGGTGGCT
GACAGGATCT GGCGGGAACG TCACCCGGAG CCCCAAAAAG AATTGACAGC CGGGAAGTAA
 
Protein sequence
MNWGLINQAK PGKERIALIN ARAHLELAKK YVTEKVLQEA FSYMEKNPEE NFPRILNTAR 
LLAREEVHKQ QIAKVLEAYR TNPSIHAYVN RLFKVHPNVK QRLIYNWFVN AMLLGIPRQH
QVSQETGVHI PNFFLLDPTS DCNLRCHGCW AGEYAHHDTL ELDLVDRLCR EAKAVGIYWL
AMSGGEPFRW PHLFELAERH PDMAFMLYTN GTLIDDAVAD RMVEVGNITP AISLEGWRER
TDARRGRGVF DRVMAAMDRL RERGLVFGVS ITITRENAEE VTSDEFIDFL LEKGVVYGWS
FHYIPIGRDP NPELMVTPEQ RAYLAERIPY IRNHKGLQIA DFWNDGELTL GCIAGGRRYF
HITASGAVEP CAFIHFSMDN IKEKSLLEVL QSPLFRAYQR RQPFSDNLLR PCPLIDVPEG
LRQIVAETGA KPTHPGADTA LKGSIGAYLD ANAARWGEVA DRIWRERHPE PQKELTAGK