Gene Moth_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2052 
Symbol 
ID3831198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2143773 
End bp2145068 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID637829981 
Productadenylosuccinate lyase 
Protein accessionYP_430891 
Protein GI83590882 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0154529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.110523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC GCTATACCCT GCCGGAAATG GGTAAAATTT GGGAACCGGA GCACAAGTTC 
CGCACCTGGC TGGCCATTGA GATCTATGCC TGCGAGGCCT GGGCGGAGCT CGGCCGGATT
CCGCCGGCCG CCCTGGAAGA GATCAAGGCC AGGGCCGATT TCGACATTGA CCGGATCAAT
GAAATTGAAG CCACCACCCG CCATGACGTC CTGGCTTTCC TCACTGCCGT TGCGGAGAAG
GTCGGTGATG CCTCCAAGTA TATTCATCTT GGTATGACAT CCTCAGACGT CCTGGATACG
GCCCTGGCCG TACAGATGCG GGACGCCGCC GACCTGCTCC TTAAGCGCCT CCGGGATCTG
CGGGCCGAAC TGGTGAAAAA GGCCATGGAG CATAAATATA CTTTGATGAT CGGCCGCACC
CACGGCGTCC ACGCCGAGCC CACCACTTTT GGCCTCAAGA TGGCCCTGTG GGTTATGGAG
GTTGATCGGC ACCTCATCCG GCTGGAACAG GCTAAAGAGA TGATCAGCGT CGGCAAAATC
TCCGGCGCCG TGGGTACTTT CGCCAATATC AACCCCCGGG TGGAGGAGCA CGTCTGCCGT
CGCCTGGGGC TCAAGCCCGC CAGCGTCTCC ACCCAGATAA TCCAACGGGA CCGTCACGCC
CAGTTCCTGG CCACCCTGGC CATCATCGGC AGCTCCCTGG AGAAAATGGC CACAGAGATC
CGCAACCTCC AGCGCACCGA CATCCTGGAG GTGGAGGAGC CCTTCGCTAA AGGTCAAAAA
GGGTCTTCGG CTATGCCCCA CAAACGAAAC CCCATTATAT CGGAACGGGT GGCCGGCCTT
TCGCGAGTGT TGCGGGGCAA CGCCCTGGCC GCCATGGAGG ACATCGCCCT CTGGCACGAG
CGGGATCTCA CCCATTCCTC CGTTGAGCGG GTGATTATCC CCGACAGCAC CATCCTGCTG
GACTATATGC TGGTAAAGTT TACCGGGATT ATCGCCGGCC TTAACGTCTA CCCGGAGAAC
ATGCGCCGGA ATCTCGAGGC CACCCACGGC CTGGTCTTTT CCCAGCGGGT CCTCCTGGCC
CTGGTGAATA AGGGCGTCCT GCGGGAAACG GCCTACGCCT GGGTGCAGCG CAACGCCCTC
AAGGCCTGGC AGACCCGCCA GCCATTTAAA GAACTGGTCC TTAAGGACCA GGATATCATG
TCTCGCCTGG ATCCGAAGGA AGTAGAGGCC CTCTTTGATT ACGACTACCA CCTGCGCCAC
GTAGATTATA TCTTCCGCCG CGCCGGTTTG GAGTAG
 
Protein sequence
MIERYTLPEM GKIWEPEHKF RTWLAIEIYA CEAWAELGRI PPAALEEIKA RADFDIDRIN 
EIEATTRHDV LAFLTAVAEK VGDASKYIHL GMTSSDVLDT ALAVQMRDAA DLLLKRLRDL
RAELVKKAME HKYTLMIGRT HGVHAEPTTF GLKMALWVME VDRHLIRLEQ AKEMISVGKI
SGAVGTFANI NPRVEEHVCR RLGLKPASVS TQIIQRDRHA QFLATLAIIG SSLEKMATEI
RNLQRTDILE VEEPFAKGQK GSSAMPHKRN PIISERVAGL SRVLRGNALA AMEDIALWHE
RDLTHSSVER VIIPDSTILL DYMLVKFTGI IAGLNVYPEN MRRNLEATHG LVFSQRVLLA
LVNKGVLRET AYAWVQRNAL KAWQTRQPFK ELVLKDQDIM SRLDPKEVEA LFDYDYHLRH
VDYIFRRAGL E