Gene Moth_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1946 
Symbol 
ID3832296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2020555 
End bp2021496 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content67% 
IMG OID637829877 
ProductL-serine ammonia-lyase 
Protein accessionYP_430787 
Protein GI83590778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00718] L-serine dehydratase, iron-sulfur-dependent, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.426247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000256832 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACCCGGT ATAATTTTCA AAGCATGGCT GAACTCCTGC AGATAGCCGC GGATGAAGGG 
CTGACCCTGG CCGGGGTAGT CATCCGCTAC CAGGAAGACC TGGAGGGTAA GAGCCGCGAA
GAGGTGCGCC GGGCGATGGG GGAGAGGCTG GCCGTTATGC GGGCGGCTGC CAGGAAAGGG
TTGCATGAAG ACATCCGTTC CCCCAGCGGC CTGGTAGGCG GGGGCGGAAA ACTCCTGGAG
GAAAGGCGCC TGGCAGGGCA GAGCCTCTGT GCCGCCACCA CCGCCCGGGC CATTGCCCTG
GCCATGGCCG TAGCTGAGGT CAACGCTTCC ATGGGCCGGG TGGTAGCCGC GCCGACGGCT
GGCTCCTGCG GCATCCTTCC AGGGGTCCTG CTGGCCCTGG AAGCGGAAAA GGGGCTGGAC
GAAGACCTGC TTATCGATGG GCTCTTTGCG GCCGCCGGTA TCGGCATGGT GGCCGCCGGG
CAGGCCTCCC TTTCGGGGGC CGCCCTGGGG TGCCAGGCCG AGGTAGGGGT GGCCGCCGCC
ATGGCGGCAG CGGCGGCCGT GGAAATGACC GGAGGGGATG CGGTCCAGGC CGCCAACGCC
GCCGGGGTCG CCCTGCAGGG CCTGATGGGA CTGGTCTGCG ACCCCGTGGG TGGCCTGGTG
GAGGTCCCCT GCGTCATGCG CAACGCCATG GGCGCGGCCC AGGCCCTGGT GGCGGCCGAC
ATTGCCCTGG CCGGCGTCCA GTGCTATATA CCTTTTGATG AAATAGTCGC AGCCATGGTC
CAGGTCGGTC GCGCCCTGCC GCCGGAATTA CGGGAGACGG GTGCCGGCGG GATAGCCGCC
TGTCCCACCG CCCGGAAACT GGCCCGGCAG ATCGGGATCA AAACCCTGGA CAAGGATTCT
CTCCAGGAGA ATCTTTCGGT AGCAAGCCCT GGTATCCCTT AA
 
Protein sequence
MTRYNFQSMA ELLQIAADEG LTLAGVVIRY QEDLEGKSRE EVRRAMGERL AVMRAAARKG 
LHEDIRSPSG LVGGGGKLLE ERRLAGQSLC AATTARAIAL AMAVAEVNAS MGRVVAAPTA
GSCGILPGVL LALEAEKGLD EDLLIDGLFA AAGIGMVAAG QASLSGAALG CQAEVGVAAA
MAAAAAVEMT GGDAVQAANA AGVALQGLMG LVCDPVGGLV EVPCVMRNAM GAAQALVAAD
IALAGVQCYI PFDEIVAAMV QVGRALPPEL RETGAGGIAA CPTARKLARQ IGIKTLDKDS
LQENLSVASP GIP