Gene Moth_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1945 
Symbol 
ID3832437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2018932 
End bp2020032 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID637829876 
Productaminomethyltransferase 
Protein accessionYP_430786 
Protein GI83590777 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000285507 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCGATT TAAAAAAGAC GCCCCTCTAC GGGGAGCACG TGGCTGCCGG GGCCAAAATG 
GTGGAATTCG GCGGCTGGTT GATGCCCGTC CAGTACAGCA GCATTATTGA GGAACATCAG
CGGGTGCGTA ACTGTGCCGG GCTCTTTGAC GTCTCCCATA TGGGGGAGAT TACCATAAAG
GGACCTGACG CCCTGGCGCT GGTCCAGAAG CTGCTTACCA ACGATGCCGA CCGGGCCACC
GGGGACAGGG TCATCTACAG CCCTATGTGT TACCCGGACG GGGGCGTAGT CGACGACCTG
CTGGTCTATC CCCGGGGAGA AGGGGAATAT CTCCTGGTAG TCAACGCCGG TAACATTGAC
AAGGACTTTG CCTGGATCCA GGAGAACGCT AGCGGTTTCC GGGTTGAGGT CAGCAATATC
TCCGCAGCTA CAGCTCAACT GGCCCTCCAG GGGCCACGAG CCCTGGAAAT TCTCCGGCCC
CTGACGAGGG TCGACCTGGC CTCCCTGGGT TATTACCGCT GGACCGAGGG CCAGGTTCTG
GGGGTTCATT GCCTGATCTC CCGCACCGGC TACACCGGCG AAGACGGTTT CGAGCTTTAC
TTTGAGGCGG CCGCAGCCCC TACCATGTGG CGGAATATCC TGGCCGCCGG CAGGGAGGCA
GGCCTGGTCC CGGCCGGACT GGGGGCCAGG GATACTCTAA GGCTGGAGGC GGCCCTGCCC
CTTTACGGCC ACGAGTTGGG CCCGGACATC AGCCCCCTGG AGGCCGGTTT GCACCGCTTT
GTCCGCCTGG AGAAGGGCGA ATTTAACGGG AGGGAGGCCC TGGCAGCCCA GCGGGAAGCC
GGGGTCAGGA GGCAACTGGT GGGACTGACC ATGATCGACC GGGGGATCCC CCGGCCGGAA
TACCCCGTCC TGGCGGCAGG CAAGGAGATT GGTTACGTTA CCTCAGGTTC CCTGGCGCCA
ACCCTGGGAC AAAATATCGC TCTGGCCCTG GTGGCGGCAG GAACTGTCTC TACCGGCGGC
GAAGTAGAAG TGAGCATCCG CGGCCGTGTC AACCGCGCCC GGGTGGTGAA ACTCCCCTTC
TATCGCCGCC CCAAAAAGTA A
 
Protein sequence
MADLKKTPLY GEHVAAGAKM VEFGGWLMPV QYSSIIEEHQ RVRNCAGLFD VSHMGEITIK 
GPDALALVQK LLTNDADRAT GDRVIYSPMC YPDGGVVDDL LVYPRGEGEY LLVVNAGNID
KDFAWIQENA SGFRVEVSNI SAATAQLALQ GPRALEILRP LTRVDLASLG YYRWTEGQVL
GVHCLISRTG YTGEDGFELY FEAAAAPTMW RNILAAGREA GLVPAGLGAR DTLRLEAALP
LYGHELGPDI SPLEAGLHRF VRLEKGEFNG REALAAQREA GVRRQLVGLT MIDRGIPRPE
YPVLAAGKEI GYVTSGSLAP TLGQNIALAL VAAGTVSTGG EVEVSIRGRV NRARVVKLPF
YRRPKK