Gene Moth_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0633 
Symbol 
ID3832531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp657533 
End bp659110 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content56% 
IMG OID637828575 
Productphosphoenolpyruvate carboxykinase 
Protein accessionYP_429505 
Protein GI83589496 
COG category[C] Energy production and conversion 
COG ID[COG1866] Phosphoenolpyruvate carboxykinase (ATP) 
TIGRFAM ID[TIGR00224] phosphoenolpyruvate carboxykinase (ATP) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000533993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA CTTATGGTTT AGAGCGATTG GGCATCATTA ATCCTGGCAC TATTTATCGT 
AACCTGCCGA TGGCCCGGCT GGTTGAAATA GCCCTGGCCC GGGGAGAAGG CCTCCTAGCT
TCCAATGGGG CTTTAAGCGT TAATACCGGC AAGTACACCG GCCGTTCCCC CCACGACAGG
TATATCGTGG ACACTCCCGC CGTTCACGAC AGCATCAGTT GGGGTGCCGT CAACCAGCCC
GTGAGCGAGG CGACCTTTGA ACGCCTCTAC AGTCGCCTGA CCGCCTACCT CCAGGGAAAG
GATCTCTTCG TTTTTGACGG CTTTGTCGGG GCCGATCCTG CCTACCGCAT GCCTATCAGG
ATTGTCAATG AGTATGCCTG GCAGAACCTG TTTGTCCACC AGCTTTTTAT CAGGCCGACG
GCGGAAGAAC TGGCCGGGCA TGAGCCCCGG TTTACCGTTA TCTGCGCTCC GGGCTTCAAG
GCCATTCCGG AGGAAGACGG TACCCGTTCC GAAGCCTTTA TTATTTTAAA CTTTGACCGG
CGGCTGGTGA TCATCGGCGG TACCTCCTAT GCCGGCGAGA TGAAAAAATC CATTTTTACC
GTCATGAATT ATTTGTTGCC AGAGCAAGGT GTTTGCCCCA TGCACTGCTC GGCTAACATG
GGCCCGGCAG GCGATACGGC CCTGTTCTTC GGTCTTTCCG GTACCGGCAA GACTACCCTG
TCGGCCGATC CGGAACGCTA CCTTATTGGC GACGATGAGC ATGGATGGTC GGACAAGGGC
ATTTTTAACT TTGAAGGCGG TTGTTATGCT AAGTGCATCA AGCTCTCCGC CGAGCATGAA
CCCCAGATCT GGAATGCCAT CCGTTTCGGC AGCGTCCTGG AGAATGTGAT GGTAGACCCC
GATTGCCGAA TCATTGACTA CGACAGCGAT GCCCTGACGG AAAACACCCG CGCTGCCTAC
CCGGTAGATT TTATCCCTAA CGCCGTCATC CCCGGGGTGG GTGGCCATCC CCAGACGGTG
GTTTTTCTCA CCGCTGACGC CTTTGGCGTT ATGCCGCCGA TAGCCAAACT CACCCGGGAA
CAGGCCATGT ACTATTTCCT GTCCGGTTAT ACCAGCAAGC TAGCCGGTAC CGAGCGGGGG
GTTACCGAGC CCAAGGCGAC TTTCTCGACT TGTTTCGGGG CACCCTTCCT GCCTCGGTCG
CCCATGGTTT ACGCCAACCT CCTGGGGGAA AGGATAGCCA GGCATAACGC CAGCGTTTAC
CTGGTCAATA CCGGCTGGAC AGGGGGGCCC TATGGCACTG GCCGGCGTAT GAGCCTGCCC
TATACTCGGG CCATGGTCAG GGCGGCTTTA AACGGTGAAC TGGATAAGGT GGAATTTACC
CCCGACCCTG TTTTCGGCTT CCTGGTACCT AAAGCCTGCC CCGGAGTCCC GGCTGAAATT
CTCAATCCAC GCAACACCTG GGCAGAAACG GAAAAATATG ATGCCATGGC TCGCAAGCTA
GCCAGCCTCT TCAGGGAGAA CTTTGCCAAA TTTAAGGACG TACCGGTCAG CATCCAGGAG
GCCGGAGTGG TTGGTTGA
 
Protein sequence
MSNTYGLERL GIINPGTIYR NLPMARLVEI ALARGEGLLA SNGALSVNTG KYTGRSPHDR 
YIVDTPAVHD SISWGAVNQP VSEATFERLY SRLTAYLQGK DLFVFDGFVG ADPAYRMPIR
IVNEYAWQNL FVHQLFIRPT AEELAGHEPR FTVICAPGFK AIPEEDGTRS EAFIILNFDR
RLVIIGGTSY AGEMKKSIFT VMNYLLPEQG VCPMHCSANM GPAGDTALFF GLSGTGKTTL
SADPERYLIG DDEHGWSDKG IFNFEGGCYA KCIKLSAEHE PQIWNAIRFG SVLENVMVDP
DCRIIDYDSD ALTENTRAAY PVDFIPNAVI PGVGGHPQTV VFLTADAFGV MPPIAKLTRE
QAMYYFLSGY TSKLAGTERG VTEPKATFST CFGAPFLPRS PMVYANLLGE RIARHNASVY
LVNTGWTGGP YGTGRRMSLP YTRAMVRAAL NGELDKVEFT PDPVFGFLVP KACPGVPAEI
LNPRNTWAET EKYDAMARKL ASLFRENFAK FKDVPVSIQE AGVVG