Gene Moth_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1989 
Symbol 
ID3832322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2070631 
End bp2071767 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content56% 
IMG OID637829918 
Productcysteine synthase / cystathionine gamma-synthase 
Protein accessionYP_430828 
Protein GI83590819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.485291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGG CTACAGAGCT GGTCCAGCTG GGAGTAGGGT ATGATAGTAA AACGGGAGCT 
ATCAGCACGC CTATCTACCA GTCAGCTACC TTCCGTCACC CGGCCCTGGG GCAGAGTACT
GGTTTTGACT ACAGCCGGAC AGGCAACCCT ACCCGCCAGG TCCTGGAAGA AGGCCTGGCC
GGGCTGGAGG GAGGCTGTCG CGCCCTGGCC TTTGCCTCCG GCATGGCCGC CATTACCGCC
GTTCTCTGCC TTTTCCGGCC CGGCGACCAC CTGGTGGTTT CTGAGGATTT ATACGGCGGT
ACTTACAGGC TGCTAAACCA AGTAGCGGTT CCCTTGGGGC TGGAGTTTTC CCTTGTAGAT
ACTACTGACC TGGCTGCCCT GGCTGCATCT ATAAGGAACA ATACGAAAGG CATCTTCCTG
GAGACACCTA CCAACCCACT AATGAAAATC ACCGATATTG CCGCCGTGGT TGCCCTGGCC
CGCCAGAGGG GCCTGTTGAC TATTGTAGAT AATACTTTTA TGACCCCTTA CCTGCAGCGA
CCCCTGGAAC TGGGAGCGGA CCTGGTGGTC CACAGCGCCA CCAAATATTT AGGCGGTCAC
AATGATGTAG TTATGGGGGC AGCGATAGCC GCCCGGGAGG ATCTCAGCGA AAGGCTGGCC
TTTATCCAAA ATACCATCGG CGCGATTCCC GGTCCCCAGG ACTGCTGGCT GGTAATCCGG
GGCTTGAAAA CCCTGGCCGT ACGCCTGGAG CGAGCCCAGG CCAACGCTTT TGAGCTGGCC
CGGTGGCTGG CCGAACACCC CCTGGTGACC AGGGTTTATT ATCCGGGCCT CCCCCATCAT
CCCGGTCACG AAATATGTAA AAAACAGTCC AGCGGGTTCG GGGCCATGCT TTCCTTTGAA
GTCAAGCACG CCGGACTGGT GGAGCAGATT TTACAGCGCT TAAAAATTAT TTCCTTTGCG
GAAAGCCTGG GTGGGGTAGA AAGCTTGATC ACTTTTCCGG AACGCCAGAC CCATGCCGAA
ATCCCTGCTG AGATGCGTCT TAAACTGGGC ATCAATGATC GTTTGTTACG TTTGTCAGTC
GGACTGGAAG ACTTGAACGA TCTCAAGGCC GACCTGGACC AGGCTCTGGC CTGTTAA
 
Protein sequence
MRLATELVQL GVGYDSKTGA ISTPIYQSAT FRHPALGQST GFDYSRTGNP TRQVLEEGLA 
GLEGGCRALA FASGMAAITA VLCLFRPGDH LVVSEDLYGG TYRLLNQVAV PLGLEFSLVD
TTDLAALAAS IRNNTKGIFL ETPTNPLMKI TDIAAVVALA RQRGLLTIVD NTFMTPYLQR
PLELGADLVV HSATKYLGGH NDVVMGAAIA AREDLSERLA FIQNTIGAIP GPQDCWLVIR
GLKTLAVRLE RAQANAFELA RWLAEHPLVT RVYYPGLPHH PGHEICKKQS SGFGAMLSFE
VKHAGLVEQI LQRLKIISFA ESLGGVESLI TFPERQTHAE IPAEMRLKLG INDRLLRLSV
GLEDLNDLKA DLDQALAC