Gene Moth_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1942 
Symbol 
ID3832434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2015713 
End bp2017170 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content62% 
IMG OID637829873 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_430783 
Protein GI83590774 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000124327 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000296653 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGG AACCCTTACT ATTTGAACTC GGTGCCCCCG GAAGGCAAGG TTACACCCTG 
CCGGAATGTG ATGTACCGGG GAAAGTGGAG GATTACCTCC CCGAAGCAAC ACGCCGCCGG
TCCGATGCTG CCCTGCCGGA ATTGAGCGAA GTAGAGGTCG TCCGCCATTT TACCCATTTA
TCCACTATGA ACTATGGCGT CGATACCGGT TTCTACCCCT TAGGTTCCTG TACCATGAAG
TATAACCCCA AGATCAATGA AGCGACGGCC AACTTGCCAG GCTTTACGGG ACTGCACCCG
CTGGTGCCGG TTGAGGCGGC CCAGGGGGCC CTGGAACTCA TGTACAACCT GCAGGAGTAT
CTCGCGGAAA TCACCGGCAT GGATGCCATC ACCCTGCAGC CGGCCGCCGG CGCCCACGGA
GAGTATACCG GCCTGGCTGT CATCGCCGCC TATCATCAGA GCCGCGGTGA CCGGGAACGG
CGCCAGGTGC TGGTGCCGGA TTCCGCCCAT GGCACCAACC CGGCCAGTGC CGCCATGGCC
GGCCTGGAGG TAGTCCAGAT ACCCTCCGAC GAGGGGGGGC TAGTGGATCT TGAAGCCCTG
AAGGCTGCCG TCGGCCCCAG GACGGCGGCC CTGATGCTGA CCAACCCCAA CACCCTGGGC
CTCTTTGAGA GCAATATCGA GGCCATGGCA GCCATCGTCC ATGCAGCCGG CGGCCTCCTC
TATTATGACG GCGCCAACCT GAACGCCATC ATGGGCCTCA CCAGGCCGGG AGATATGGGC
TTTGACGTAG TTCACTTAAA CCTCCACAAG ACCTTCTCCA CCCCCCACGG CGGCGGTGGT
CCCGGCAGCG GCCCGGTGGG GGTGAAGGAG CACCTGGCTG CCTTCCTGCC GGTGCCGGTG
GTGGCTCGCC GGGAGGACGG CCAGTATTAC CTGGATTACG ACCGGCCCCA GAGCATCGGC
CAGGTACGTT CCTTCTATGG TAATTTCGGC GTCATGGTCA AGGCCTACAC CTATATCCGC
TCCCTGGGGG CCCCGGGCCT GAAGAGGGTC AGCCAACAGG CGGTTTTGAA TGCCAACTAC
ATGCTGGCGC GCCTCAGGCC CTACTTCAAG GTGCCCTTCG ACCGGCTGTG CAAGCACGAG
TTTGTCATCG CACCGTCCCA GGAGGTAACT GATGCCGGCG TTCATACCCT GGATATAGCC
AAACGCCTCC TGGACTACGG TTTCCATGCG CCTACCATCT ACTTCCCCCT CATTGTCCGC
GAGGCCATGA TGATCGAACC GACGGAAACG GAGCCCCGGG AGAACCTGGA CGCCTTCTGC
GACGCTTTGA TTGCTATTGC TAAAGAGGCA GTTGAGAACC CGGAGGCTCT GCACCAGGCG
CCCCATAACA CCCCAGTGCG GCGCCTGGAC GAGGTGGGCG CTGCCAGGAA CCCGGTTTTA
CGCTGGCGGG GCAGGTAG
 
Protein sequence
MKTEPLLFEL GAPGRQGYTL PECDVPGKVE DYLPEATRRR SDAALPELSE VEVVRHFTHL 
STMNYGVDTG FYPLGSCTMK YNPKINEATA NLPGFTGLHP LVPVEAAQGA LELMYNLQEY
LAEITGMDAI TLQPAAGAHG EYTGLAVIAA YHQSRGDRER RQVLVPDSAH GTNPASAAMA
GLEVVQIPSD EGGLVDLEAL KAAVGPRTAA LMLTNPNTLG LFESNIEAMA AIVHAAGGLL
YYDGANLNAI MGLTRPGDMG FDVVHLNLHK TFSTPHGGGG PGSGPVGVKE HLAAFLPVPV
VARREDGQYY LDYDRPQSIG QVRSFYGNFG VMVKAYTYIR SLGAPGLKRV SQQAVLNANY
MLARLRPYFK VPFDRLCKHE FVIAPSQEVT DAGVHTLDIA KRLLDYGFHA PTIYFPLIVR
EAMMIEPTET EPRENLDAFC DALIAIAKEA VENPEALHQA PHNTPVRRLD EVGAARNPVL
RWRGR