Gene Moth_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0563 
SymbolproA 
ID3831463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp585769 
End bp587025 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID637828504 
Productgamma-glutamyl phosphate reductase 
Protein accessionYP_429436 
Protein GI83589427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0014] Gamma-glutamyl phosphate reductase 
TIGRFAM ID[TIGR00407] gamma-glutamyl phosphate reductase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGTAG CCCTGGAGGT GCAGACCAAA GGCCAGAGAG CCAGAGAAGC CGCCCGGATC 
CTGGCCGGCC TGGGAACCAG CAAGAAAAAC GAGGCCTTAC TGGCTATGGC CCGGGCCCTG
GAGGAAGAAC AGGAAGCCAT CCTGGCCGCC AACGCCAGGG ACATGGTCGC CGGAAAAGAA
AAGGGCCTTT CCCGGGCCCT CCTGGACCGC CTTCTCCTCA ATGAGAAGCG CATCAGGGAT
ATGGCTGCCG GCCTGCGGGA ACTGGCTGCT CTGCCGGACC CCGTGGGCGA GGTGACCTCC
ATGTGGACCC GGCCCAACGG CCTGCAGATT GGCCGGGTAC GGGTGCCCCT GGGGGTTATC
GGTATTATTT ACGAGGCTCG GCCCAATGTC ACCGTCGATG CCGCCGGGCT CTGTCTAAAA
ACCGGCAATG CCGTCATCCT GCGGGGCGGG TCCGAGGCCT TTTATTCCAA CCAGGCCCTA
ACCCGTGTCA TCAGCCGGGC GGCAACGGCT GCCGGAGCGC CGGAGGGGGC CATCCAATTA
ATCGAGACCA CCGACCGGGA AGCTGTAAAT CTTTTATTGC GGGCCAATGA TTACCTGGAT
GTTTTGATTC CCAGGGGCGG GGCCGGCCTG ATCCGGACCG TGGTAGAAAA CGCCACCGTG
CCCGTCATTG AAACCGGTGT GGGGAACTGC CACGTCTATG TCGACGCCGA AGCCGACCTG
GATATGGCCC AGAGGATTGT CATTAACGCC AAGACCCAGC GTCCGGGTGT TTGTAACGCC
ATGGAAACTC TGCTGGTCCA TGAAAAGGTG GCGGACTCCT TTCTCCCCTC CCTGGCCGCG
GCTTTAAAGG AAAAGGGAGT CACCATCCGG GGCTGTGAAC GTACCCGGGC CATCATACCC
TGGGCGGAAG TTGCCACCGA AACCGACTGG GCCACTGAGT ACCTGGATCT CATCCTGGCC
ATAAGGGTTG TCGACTCCCT TGAGAGCGCC CTGGAGCATA TCCATCGTTA CGGCACCAAA
CACTCGGAAG CCATTGTTAC GACCAACTAC CAGACGGCCC GGGAATTCCT GGCCCGGGTG
GATGCGGCGG CCGTATACGT CAATGCCTCA ACGCGTTTTA CCGATGGCTA CGAGTTCGGT
TTCGGGGCCG AGATTGGTAT CAGTACCCAG AAACTCCATG CCCGTGGTCC CATGGGGCCG
GAACAACTAA CAACTTTTAA GTATATTATT TTTGGTAGTG GACAGATCCG CCAGTAA
 
Protein sequence
MNVALEVQTK GQRAREAARI LAGLGTSKKN EALLAMARAL EEEQEAILAA NARDMVAGKE 
KGLSRALLDR LLLNEKRIRD MAAGLRELAA LPDPVGEVTS MWTRPNGLQI GRVRVPLGVI
GIIYEARPNV TVDAAGLCLK TGNAVILRGG SEAFYSNQAL TRVISRAATA AGAPEGAIQL
IETTDREAVN LLLRANDYLD VLIPRGGAGL IRTVVENATV PVIETGVGNC HVYVDAEADL
DMAQRIVINA KTQRPGVCNA METLLVHEKV ADSFLPSLAA ALKEKGVTIR GCERTRAIIP
WAEVATETDW ATEYLDLILA IRVVDSLESA LEHIHRYGTK HSEAIVTTNY QTAREFLARV
DAAAVYVNAS TRFTDGYEFG FGAEIGISTQ KLHARGPMGP EQLTTFKYII FGSGQIRQ