Gene Lferr_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1023 
Symbol 
ID6876992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp991956 
End bp993257 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID642788901 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002219472 
Protein GI198283151 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00127553 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000467084 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACTCGAT ATCTCGTTCG TCCAGGCAGT CGGCTTGCCG GCCGTTTCCC GGTCCCGGGA 
GACAAATCCA TTTCCCACCG TGCGGTCATT CTCGGGGCGC TTGCTGAAGG TGTCACTGAA
GTGGAGGGGT TGCTGGAGGG CGCCGATGTG CTGGCCACCA TCGCCGCTTT TCGCAGCATG
GGGGTGCAGA TGGAAGGCCC CGATAAAGGG CATCTGCGCA TTCATGGGGC TGGCCTGCAG
GGGCTGCGCG CGCCCGTTGT TCCTCTGGAT TGCGGGAATT CCGGTACCGC CATGCGTCTC
TTGGCGGGAG TGCTGGCCGG ACAGCCTTTC CCCAGCACCC TGGTTGGCGA CGCCAGTTTG
CAGAAGCGGC CCATGGGCCG CATTCTGAAC CCGTTGCGTG CCATGGGCGC GGAAATCGCC
GCCCAGGATG GCAGGGCACC TTTACATATT CATGGGCGGC CCCTGCACGG CATCGACTAT
GCCCTGCCGG TGGCGAGCGC CCAGGTCAAA TCCGCCGTGT TGTTGGCCGG ACTGTATGCC
GACGGACAAA CCTGCGTGAC CGAGCCCGCA CCCACCCGCG ATCACAGCGA GCGGATGCTG
CAAGGTTTTG GCCAACCGGT GGAGCGTCAT GGCCCGCGCG CCTGCCTGCG CGGCGGTGGC
CGGTTGTGCG GGCAGGCGCT GCAGGTGCCG GGCGATATTT CGTCGGCCGC GTTCTTCCTG
CTCGGCGCCA CCATCGCGCC GGGCTCCGAT CTCACCCTGG AAGGAGTTGG TATCAATCCG
ACCCGGACCG GTATCATCGA AATCCTCACC CGCATGGGCG CCCGGATCGA TCTGACGGCC
TTGCGTGAAG TGGGCGGTGA GCCAGTCGCT GATATCCGCG TTCGCTATGC ACCATTGCAA
GGTATCGCTA TCCCACCTCG GCTGGTACCT CTGGCTATCG ATGAATTCCC CGCGCTGTTC
ATAGCGGCGG CGTGTGCGAA GGGGCAGACG GTGATTACCG GTGCCGAGGA ACTCCGTGTC
AAGGAAAGCG ACCGCATCGC GGTAATGGCT GGAGGGCTGC GCGCGCTGGG TGCTACGGTA
GAAGAGCGTG TGGATGGCGC CATTATCAGC GGATCAGCGC TGCTGGGCGG CCGGGTGGAC
AGTCATGGGG ATCATCGTAT TGCGATGGCT TTTGCCATGG CCGCACTGGT GGCGCAGGGG
GATATGGAAA TCCTGGACTG TGCCAATGTG GCGACGTCCT TTCCGAGTTT TCCCGCGCTG
GCGCAGCAGG CGGGGCTGCT GCTGGAGGTG GCAAGCGCAT GA
 
Protein sequence
MTRYLVRPGS RLAGRFPVPG DKSISHRAVI LGALAEGVTE VEGLLEGADV LATIAAFRSM 
GVQMEGPDKG HLRIHGAGLQ GLRAPVVPLD CGNSGTAMRL LAGVLAGQPF PSTLVGDASL
QKRPMGRILN PLRAMGAEIA AQDGRAPLHI HGRPLHGIDY ALPVASAQVK SAVLLAGLYA
DGQTCVTEPA PTRDHSERML QGFGQPVERH GPRACLRGGG RLCGQALQVP GDISSAAFFL
LGATIAPGSD LTLEGVGINP TRTGIIEILT RMGARIDLTA LREVGGEPVA DIRVRYAPLQ
GIAIPPRLVP LAIDEFPALF IAAACAKGQT VITGAEELRV KESDRIAVMA GGLRALGATV
EERVDGAIIS GSALLGGRVD SHGDHRIAMA FAMAALVAQG DMEILDCANV ATSFPSFPAL
AQQAGLLLEV ASA