Gene Lferr_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1019 
Symbol 
ID6876988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp987879 
End bp988955 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID642788897 
Productchorismate mutase 
Protein accessionYP_002219468 
Protein GI198283147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01801] chorismate mutase domain of gram positive AroA protein
[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.789998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000373166 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAAAACC CGGAACTGGC GGCCCTGCGT AAGGCCATCG ATCAGGTGGA TCAGCAGTTC 
CTGCAACTGC TCGGCGAGCG GGGCCGGCTT GCGCAGCAGG TGGGTGCCGT CAAACAGGCT
GCGGGCGAAG TGAATTTTTA CCATCCAGAC CGTGAGTCGG AGATTCTCCG TCGGGTCATG
GCCGATAACC CTGGGCCCTT TTCCAGTGAA CAGGTTGCCA TCATTTTCCG GGAGATCATC
TCTGCAGGCC TGGCCCTGGA GCAACCCCTG CAAGTGGCCT ATCTTGGACC CGCCGGCACG
TTTTCGCAGA TCGCGGCGCA GAAGCATTTC GGGCGCGCGG CCGTTCTGCA GCCCACCGCG
GGGATCGCCG AGATTTTCCG CCTGGTGGAC AGTGACCAGG CCCGGTTCGG TGTGGTGCCG
GTGGAGAACA GCACTGAAGG TTCCGTCAAT CTCAGTCTGG ATCTGCTCCT GGATTACCCC
TTGCAGATCT GCGGCGAGGT CCAGTTACGC ATCGTCCATA ATCTGGTGGC CAAGGTGCCC
ATCTCCACCG TTCGCCGTGT CTACGTTCAT TATCAGACCA GGGCCCAGTG CCGTCAGTGG
CTCGCGACCC ATTTGCCGCA GGCGGAATTG GTGGATGTGG CCAGCAACGC GGTTGCCGCG
GAACGGGCTG CGACAGATGC CGATGGCAGC GCCATTTCCA CGACCCTCGC CGCGGAAGCG
TACGGCCTCG ACATTCTGGT CGCGGGGATC GAAGACAACC CGGAGAACAC CACCCGTTTT
CTGATCATTG GCAAAATCCA TACGCGACCT ACGGGGAATG ACAAGACCAG CCTGGTGGTA
GCCGGCGCCA ATCGTCCGGG GAGTCTGCAT GCGTTGCTGT CACCGCTGGC CGACGCGGGC
ATCAGTCTGA CGCGCATCGA GTCACGGCCG GCACGCTCGG CCATCTGGGA GTACGTCTTT
TATCTCGACT TGCTTGGCCA TTGTCAGGAT GCCGCCATCG CTCCGGTGCT GGATGTTCTC
GCGCAACAGG CATCCTTTTG CCGTTGTCTC GGCAGTTATC CCCGGGCGGT ATTTTGA
 
Protein sequence
MKNPELAALR KAIDQVDQQF LQLLGERGRL AQQVGAVKQA AGEVNFYHPD RESEILRRVM 
ADNPGPFSSE QVAIIFREII SAGLALEQPL QVAYLGPAGT FSQIAAQKHF GRAAVLQPTA
GIAEIFRLVD SDQARFGVVP VENSTEGSVN LSLDLLLDYP LQICGEVQLR IVHNLVAKVP
ISTVRRVYVH YQTRAQCRQW LATHLPQAEL VDVASNAVAA ERAATDADGS AISTTLAAEA
YGLDILVAGI EDNPENTTRF LIIGKIHTRP TGNDKTSLVV AGANRPGSLH ALLSPLADAG
ISLTRIESRP ARSAIWEYVF YLDLLGHCQD AAIAPVLDVL AQQASFCRCL GSYPRAVF