Gene AFE_0897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_0897 
SymbolpheA 
ID7136751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp813689 
End bp814765 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID643529297 
Productchorismate mutase/prephenate dehydratase 
Protein accessionYP_002425375 
Protein GI218665183 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01801] chorismate mutase domain of gram positive AroA protein
[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255743 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACC CGGAACTGGC GGCCCTGCGT AAGGCCATCG ATCAGGTGGA TCAGCAGTTC 
CTGCAACTGC TCGGCGAGCG GGGCCGGCTT GCGCAGCAGG TGGGTGCCGT CAAACAGGCT
GCGGGCGAAG TGAATTTTTA CCATCCAGAC CGTGAGTCGG AGATTCTCCG TCGGGTCATG
GCCGATAACC CTGGGCCCTT TTCCAGTGAA CAGGTTGCCA TCATTTTCCG GGAGATCATC
TCTGCAGGCC TGGCCCTGGA GCAACCCCTG CAAGTGGCCT ATCTTGGACC CGCCGGCACG
TTTTCGCAGA TCGCGGCGCA GAAGCATTTC GGGCGCGCGG CCGTTCTGCA GCCCACCGCG
GGGATCGCCG AGATTTTCCG CCTGGTGGAC AGTGACCAGG CCCGGTTCGG TGTGGTGCCG
GTGGAGAACA GCACTGAAGG TTCCGTCAAT CTCAGTCTGG ATCTGCTCCT GGATTACCCC
TTGCAGATCT GCGGCGAGGT CCAGTTACGC ATCGTCCATA ATCTGGTGGC CAAGGTGCCC
ATCTCCACCG TTCGCCGTGT CTACGTTCAT TATCAGACCA GGGCCCAGTG CCGTCAGTGG
CTCGCGACCC ATTTGCCGCA GGCGGAATTG GTGGATGTGG CCAGCAACGC GGTTGCCGCG
GAACGGGCTG CGACAGATGC CGATGGCAGC GCCATTTCCA CGACCCTCGC CGCGGAAGCG
TACGGCCTCG ACATTCTGGT CGCGGGGATC GAAGACAACC CGGAGAACAC CACCCGTTTT
CTGATCATTG GCAAAATCCA TACGCGACCT ACGGGGAATG ACAAGACCAG CCTGGTGGTA
GCCGGCGCCA ATCGTCCGGG GAGTCTGCAT GCGTTGCTGT CACCGCTGGC CGACGCGGGC
ATCAGTCTGA CGCGCATCGA GTCACGGCCG GCACGCTCGG CCATCTGGGA GTACGTCTTT
TATCTCGACT TGCTTGGCCA TTGTCAGGAT GCCGCCATCG CTCCGGTGCT GGATGTTCTC
GCGCAACAGG CATCCTTTTG CCGTTGTCTC GGCAGTTATC CCCGGGCGGT ATTTTGA
 
Protein sequence
MKNPELAALR KAIDQVDQQF LQLLGERGRL AQQVGAVKQA AGEVNFYHPD RESEILRRVM 
ADNPGPFSSE QVAIIFREII SAGLALEQPL QVAYLGPAGT FSQIAAQKHF GRAAVLQPTA
GIAEIFRLVD SDQARFGVVP VENSTEGSVN LSLDLLLDYP LQICGEVQLR IVHNLVAKVP
ISTVRRVYVH YQTRAQCRQW LATHLPQAEL VDVASNAVAA ERAATDADGS AISTTLAAEA
YGLDILVAGI EDNPENTTRF LIIGKIHTRP TGNDKTSLVV AGANRPGSLH ALLSPLADAG
ISLTRIESRP ARSAIWEYVF YLDLLGHCQD AAIAPVLDVL AQQASFCRCL GSYPRAVF