Gene AFE_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_3042 
SymbolhisC-1 
ID7134302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp2732855 
End bp2733934 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID643531393 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002427409 
Protein GI218667068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA GCGCTACGAC CGCCCTGATG CAGGGCCTGC TCCGCCCGGA ATTGCTGGCC 
AGCAAGGCCT ATGCGGTGGC AGACGGCGAG GGGCTCATCA AACTGGATGC CATGGAGAAC
CCCTATGGCT TACCGGCGGC CTTGCGTGAG CAATGGCTGG AGAGCCTGAC CGACGCGCCC
CTCAATCGCT ATCCCGACGC ACACCCGACC CTCCTCATGG AGGGGCTCAA GGCCCACATC
GGCCTGCCTG CCGGAATAGA ACTCATGCTC GGTAACGGCT CCGATGAGCT GATCCAGATT
CTGGTGACCG CAGTAGCAGG CAGCCGACGC CCCATCATGG CGGTAGACCC CAGTTTCGTC
ATGTACCGGC TGCTGGCGCA GCAGCTTGGT CTGCCTTTTG TGGGTATTCC CCTGGATGCG
GACTTCCAGC TCGACCTTCC GGCCATGCTG GCGGCCATCG CCGCGCAACA ACCCGCCATC
ATTTTTCTCG ACTGGCCCAA CAATCCCAGT GGCAGCCTTT TCCCCGAGAC CGATCTGGAG
GCCATTGTCG CTGCAGCGCC GGGCCTGGTG GTGGTGGATG AGGCCTATCA CGCCTTCAGT
CAGAAGACCT TTGCCGATCA CCTGGGACGC ACCCCCAACC TCCTCTTGCT GCGCACCATG
TCCAAGGAGG GGCTGGCGGG GATGCGGCTG GGAATGCTGG CGGGGCCCGC CGCATGGATT
CAGGAACTGG ACAAGCTGCG CCTGCCTTAC AATATCAATG TACTCACCCA GCGCAGCGCG
TCCTTCTACC TGCGTCACAC CGAAGTACTG AATGCCCAGG CCGAAATTCT GCGTGTCGAG
AGGGAACGGC TCTTCAAGGC CATCCGTGCC TGCGGCCTTT CGGTCTGGCC CAGCGCCGCC
AATTTCCTGC TCTTTCATGC CCCGGGGCGG GCAGCGGTGC TGTTCTCAGG CCTGCGCGCG
GGTGGAGTGC TCATCAAAGC CTTTACAGGC CACCCCCGCC TCGGCGAATA TCTACGGGTC
AGTGTCGGCA CTCCCGCTGA AAATGACCGC TTTCTGGCCG TATTGGAGTC CTTACTGTGA
 
Protein sequence
MSDSATTALM QGLLRPELLA SKAYAVADGE GLIKLDAMEN PYGLPAALRE QWLESLTDAP 
LNRYPDAHPT LLMEGLKAHI GLPAGIELML GNGSDELIQI LVTAVAGSRR PIMAVDPSFV
MYRLLAQQLG LPFVGIPLDA DFQLDLPAML AAIAAQQPAI IFLDWPNNPS GSLFPETDLE
AIVAAAPGLV VVDEAYHAFS QKTFADHLGR TPNLLLLRTM SKEGLAGMRL GMLAGPAAWI
QELDKLRLPY NINVLTQRSA SFYLRHTEVL NAQAEILRVE RERLFKAIRA CGLSVWPSAA
NFLLFHAPGR AAVLFSGLRA GGVLIKAFTG HPRLGEYLRV SVGTPAENDR FLAVLESLL