Gene Lferr_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1021 
Symbol 
ID6876990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp990059 
End bp991075 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content59% 
IMG OID642788899 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002219470 
Protein GI198283149 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000377176 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATCGTTA TCGTCAAACC GCAAGCCACC GCAGAGCAGA TGGAGCATCT GTTGGAGCGT 
ATCCGCCAGT ACGGTCTGCA GCCCATGGTC AGTACCGGCT CTGAGCGCAC CGTGGTGGGC
CTGTTGGGGG ACGAGCGCTT GCTGCCTGAC GGCGCACTGG AATCCTTGCC CGCCGTGGAA
CAGGTCATGC CCATTCTCAA GCCCTACAAG CTCGTCAGCC GCGAGTTCAA ATCCACCGAT
ACGGTCATTG AAGTGCGGGG GATCCCCATC GGTGGCAGGC AGATTCAGGT GATCGCAGGC
CCCTGCTCGG TTGAAACGCC CGAGCAGATG CGCAGTTCCG CAGAAGCCGT CAAGGCGGCG
GGGTGCCGTC TGATGCGTGG CGGGGCATTC AAACCCCGTA CCAGTCCCTA CACTTTTCAG
GGCGTGGGTG ACGAGGGCTT GGATTATTTC CGGACCGCTG CAGATGCCGC CGGTCTGCCC
ATTGTTACCG AACTGATGGA TGTCCGCAAG ATTGATCTAT TTCTGGAAAA AGGAGTGGAC
ATCATCCAGA TCGGTACGCG CAACATGCAG AATTTTGATC TCCTCAAGGA AGTCGGTCGC
CTGGACGTGC CGGTCATCCT CAAGCGTGGC CTGAGCGCCA CCATAAAGGA ATGGCTCATG
GCGGCAGAAT ATATCGCCGC CCACGGCAAC CACAGGATTA TTTTTGCCGA ACGTGGCATC
CGTACCTTTG AAACGGCCTA TCGCAACGTC CTCGACGTCA CGGCCATACC GGTTCTCAAG
CGAGAGACCC ACCTGCCGGT CATCGTCGAT CCCAGTCACG CGGGTGGTAA AGCCTGGCTC
GTGCCGCAAT TATCCAAGGC GGCCATTGCG GCGGGTGCAG ATGGTTTGTT GGTGGAGTCG
CATCCCTGCC CCGAAGAAGC CTGGTGTGAT GCCGATCAGG CTCTGAGTCC GGAGCAACTC
ACGACGCTGA TGGGCGATCT GCGCAGGATT GCCGAGGCCA TTGGCCGCGA ACTCTAG
 
Protein sequence
MIVIVKPQAT AEQMEHLLER IRQYGLQPMV STGSERTVVG LLGDERLLPD GALESLPAVE 
QVMPILKPYK LVSREFKSTD TVIEVRGIPI GGRQIQVIAG PCSVETPEQM RSSAEAVKAA
GCRLMRGGAF KPRTSPYTFQ GVGDEGLDYF RTAADAAGLP IVTELMDVRK IDLFLEKGVD
IIQIGTRNMQ NFDLLKEVGR LDVPVILKRG LSATIKEWLM AAEYIAAHGN HRIIFAERGI
RTFETAYRNV LDVTAIPVLK RETHLPVIVD PSHAGGKAWL VPQLSKAAIA AGADGLLVES
HPCPEEAWCD ADQALSPEQL TTLMGDLRRI AEAIGREL