Gene Afer_0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0874 
Symbol 
ID8322938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp894443 
End bp896053 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID644952008 
ProductPEP-utilizing protein 
Protein accessionYP_003109492 
Protein GI256371668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAC GTCACACCCT TCGCGGTCAA CCAGGCTCCC AGGGAGCCGG CGTCGGGACA 
GCCGTCCGCG TCGACGCGGT CGCCGAGCGC ACCGAGGTCG ATCCCGCCAG CGTGCGCCGA
GCGCTCGAGG AGGTCGCCGA CGACCTCGAG GCTTCCAGCC GACGTGCGAG CGGCGAGCTG
TCCCAGATCC TCGCCGCCGA CGCAGCCATC GCGCGAGACC CCATGCTCGT CGATGCCGTG
GAGCGTCACC TCGCCGACGA TCCCTCCACC GTCGGGGTCC ACGACGCCTT CGACGAGGTC
GCAGGCGTCC TCCGTTCCGT CGGCGGCGCG ATCGCCGAAC GAGCGGCGGA CCTCGGCGCG
ATCGAACGGC GCCTCATCGC CCGCCTCAGG GGCACCCAAG GACCGATGCT CGAAGGCAAG
GTCGTCGTCG CGAACGAGCT CGGCCCAGCA GATCTCCTCG CGATCGAGCA CGAACGGCCG
GCAGCGCTCT TGCTCGCTGG CGCCAGCCCG ACCGCCCACG TCGCCATCCT CGCCCGAGCA
CTCGGCATCC CCGCCCTCAC CGGCGTGGTG GGCCTCGACG GGGTGCACGA CGGCGACACC
GTGCTCGTCG ATACCGTGCG CGCGGTCGCG ATCGTGAACC CCAACGACGA CGATGTCACT
GGGCTGCGAG CCGCAGAGCG TACGCCCGCG CGAACGACGC TCCCTCGAGA CCGTGCCGCC
ATCGGCGCCG TCGCCATCAT GGCCAACGTC GCCGGCGTCG CCGACGCGCA GGGGGCGATC
GACGCCGGCG CGGTCGGTAT CGGCCTTTTG CGCACCGAGT TCTTGTTCCT CGACCGCGAC
GAGGCTCCCT CTCGCGCCGA GCAAGCCGAG GCCTACACCG AGATCCTCAC CCCATTCCGG
GGCCGCCGTT GCATCGTCCG CACGCTCGAC GCCGGTGCCG ACAAGCCACT CGCGTTCATC
GATCTGCCTC GCTCGGCCAA CCCCGCGCTC GGCGTCCGAG GATGGAGGGC ACGCGCCGTG
GCGCCGGCCG TGATCGACAC CCAGATCGCA GCGATCGCCG ACGCCCAGCG CGCGACCGGT
GCCGAGGTCG GTCTCATGGC CCCGATGGTG ACGACGATCG ACGAAGCGCG CGAGGTGGTC
GAGCGAGCCC ACGCCGCAGG GATCCCGAGT GCAGGCGTCA TGGTGGAGGT CCCTGCTCTG
TGCCTGCTCG GCGACGAGCT CGCCCGCAGC GTCGACTTCG TCTCGATCGG CACCAACGAC
CTCGCGCAGT ACCTCTTCGC CGCCGACCGC GAGGAGTCGG CGGTCGCAGC CCTCGCCGAT
CCCTTCTCGC CGCCACTCGC TCGACTCCTC GCTCGCCTCG TCGACGACGT CGACGGTCGC
ATCCCCATCG GTGTCTGCGG CGAGCTCGCC GCAGATCCCC TCGCCGCCGT CTGGCTGGCG
GGCCTCGGCA TCACGAGCTT GTCCATGACG CCGAGCGCGA TCGCACCCGT CACCCGTCTG
CTCGCTTCCG TCGAGCGTAC GACCGCTCGC CGAGCAGCAG AAGCGGTCCG CACCGCGAGC
GATGCTCAGC GTGCACGAGA CGCGGCGGCG CGTATCGTCG GCCTTGCCTA G
 
Protein sequence
MPERHTLRGQ PGSQGAGVGT AVRVDAVAER TEVDPASVRR ALEEVADDLE ASSRRASGEL 
SQILAADAAI ARDPMLVDAV ERHLADDPST VGVHDAFDEV AGVLRSVGGA IAERAADLGA
IERRLIARLR GTQGPMLEGK VVVANELGPA DLLAIEHERP AALLLAGASP TAHVAILARA
LGIPALTGVV GLDGVHDGDT VLVDTVRAVA IVNPNDDDVT GLRAAERTPA RTTLPRDRAA
IGAVAIMANV AGVADAQGAI DAGAVGIGLL RTEFLFLDRD EAPSRAEQAE AYTEILTPFR
GRRCIVRTLD AGADKPLAFI DLPRSANPAL GVRGWRARAV APAVIDTQIA AIADAQRATG
AEVGLMAPMV TTIDEAREVV ERAHAAGIPS AGVMVEVPAL CLLGDELARS VDFVSIGTND
LAQYLFAADR EESAVAALAD PFSPPLARLL ARLVDDVDGR IPIGVCGELA ADPLAAVWLA
GLGITSLSMT PSAIAPVTRL LASVERTTAR RAAEAVRTAS DAQRARDAAA RIVGLA