Gene Moth_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1990 
Symbol 
ID3832323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2071845 
End bp2072996 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content62% 
IMG OID637829919 
Productcystathionine beta-lyase 
Protein accessionYP_430829 
Protein GI83590820 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.916692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGAG GAACGCGACT GATCCATCAC CGCTTATCTA TGGATTCTGC TACCGGAGGG 
GTGAGTATCC CCATCCACCA GAGCGTGGTA TTTGCCCAGG AAAGCCTGGA TCAGCCGGGC
GAATACGAAT ACACCCGTTC CGGCAATCCT ACCCGGCGGG CCCTGGAAGA GGCCATCGCC
GAGCTGGAAG GAGGTAATTA CGGTTTTGCT TTTGCTTCCG GAATGGCAGC TATCACCGCC
GCTTTAAGCC TTTTTTCGTC CGGCGACCAC CTGCTGGTAT CCAGGGATAT CTATGGCGGC
ACCTACCGGG CCCTGGCCGA GGTTTTCCCG CGTTTCGGCC TGGAAGTAAC CTTCGTGGAT
ACCACCAACC TGGAGACAGT GGCGGCCCAG ATCCGGCCTT CTACCAAAGG GCTTTACCTC
GAAACCCCTT CCAACCCGCT GATGAAAATC ACCGACCTGG CCAGGGCCGC CGCCCTGGCC
AGGGAACACG GCTTGATAAC CATAGCAGAC AATACCTTCA TGACTCCCTA CCTGCAGCGG
CCCCTGGAAC TGGGAATTGA CATCGTCGTC CACAGCGCCA CCAAATACCT GGGCGGCCAC
AGCGACTGCC TGGCAGGCCT GGCTGTCACC AGGGACGCCG GCCTGGCCAG GGAACTGACC
CTGCTGCAAA ACACCCTGGG GACCGTCCTG GCCCCCCATG AGTGCTGGCT GATTTTACGG
GGCATCAAGA CTCTGAAGGT GCGCCTGCTC CAACAACAAC GGACGGCGAC TGTACTGGCG
GAATGGTTAC GCAAACACCC GCAAGTGAAG GCCGTCTACT ACCCGGGCCT GGAGGGGCAC
CCGGGCCGGG AAACGCACTT TCGCCAGGCC GACGGTGGCG GGGGCGTACT CTCCTTCCGC
CTGGCTACGC CGGAACTGGC CCGCCAGGTC ATTAACAACG TCAGACTGCC GGTCATTGGT
TCCAGCCTGG GGGCTGTGGA GAGCATCATC TCCCTACCGG CCACCATGTC CCACGGCAGC
CTGCCGGGAG AGCTAAAGCG CGAACTCGGG ATCACCCCCG ACCTGGTACG GCTGTCGGTG
GGTCTGGAGG AGGCGGAAGA CCTGCAGGCC GACCTGGAGC AGGCACTGGA TTCTCCCCGG
GGGCACAGGT AA
 
Protein sequence
MQRGTRLIHH RLSMDSATGG VSIPIHQSVV FAQESLDQPG EYEYTRSGNP TRRALEEAIA 
ELEGGNYGFA FASGMAAITA ALSLFSSGDH LLVSRDIYGG TYRALAEVFP RFGLEVTFVD
TTNLETVAAQ IRPSTKGLYL ETPSNPLMKI TDLARAAALA REHGLITIAD NTFMTPYLQR
PLELGIDIVV HSATKYLGGH SDCLAGLAVT RDAGLARELT LLQNTLGTVL APHECWLILR
GIKTLKVRLL QQQRTATVLA EWLRKHPQVK AVYYPGLEGH PGRETHFRQA DGGGGVLSFR
LATPELARQV INNVRLPVIG SSLGAVESII SLPATMSHGS LPGELKRELG ITPDLVRLSV
GLEEAEDLQA DLEQALDSPR GHR