Gene Afer_0792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0792 
Symbol 
ID8322854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp801844 
End bp803199 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID644951927 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_003109413 
Protein GI256371589 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.616479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGC TTGACGACGC CGGTTGGCAG CCGTGGGACT GGCAGCAGCG TGTGGCCGCG 
CAGCAGCCCG AGTGGCCCGA TCCCGAGGCG TTGGACGCGG TCATCAAGGA GTTGGCTCAG
CGCCCAGCCC TCGTGGTCGC CAAGGACGTC GACCGGCTAC GAGCCGCACT GGCGCGTGCG
GCCCGTGGAC GTGCCTTCGT GCTCCAAGCC GGTGACTGCG CGGAGAGCTT CCACGATCAC
TCCGCGAGTT CGCTGCGCGC CAAGCTGAAG ATCATCTTGC AGATGGCCGT GGTCCTGACC
TACTCGTCGG GCGTGTCGGT GGTCAAGATC GGGCGTATCG CCGGCCAGTT CGCCAAGCCG
AGGTCGGCGC CCGTCGAGGT CGTCGACGGG GCGACCTTGC CGTCGTTTCG CGGGCACATC
GTCCATGATG ACGCCCCGAC ACTGGACGCG CGTCGTCCCA ACCCCGAACG CCTACTCTGG
GCCTACGATC AGTCCCGAGC GACGGTGAGC GTGTTGCGAG CCCTCACCGA GGGGGGCTTC
GCGGATCTCT CCGGGGCGCA TCGGTGGAAC CTCGACTTCG TCGCCTCGTC GCCGGAAGGG
CAGCGCTACC AGGCGATCGC CGATGGGGTC GATCGGGCAC TTCGCTTCAT GGCAGGCTGC
GGGATCGATC TCGAGCGCGA GGCCGTTCTG CACCAAGTGA ACGTGTGGAC CTCGCACGAG
GCGCTCTTAC TGCCCTACGA GGCGGCGCTC ACCCGGCGCG ATCCTGCCTC GCAGCGCTAC
TACGACCTCT CGGCCCACAT GGTCTGGGTG GGCGAGCGCA CGCGCCAGCT CGACGGCGCA
CACCTGCGTT TTGCATCGGG GATCGCCAAT CCGGTCGGAC TGAAGGTCGG CCCCACGATG
GAGCCCGACA CCCTCGTCGA GGCCTGCCGG ATCCTCGATC CTGATCGGAC GCCGGGTCGG
CTGGTGCTCA TCTCACGCAT GGGTCACGAC GCCGTGCGTG ATCGCCTCGG AGGTCTCGTC
GAGGCCGTAC GTGAGGCGGG CTATCCGGTG GTGTGGCTGT GCGATCCGAT GCACGGCAAC
ACCTTCGTCT CCCAGTCAGG CTACAAGACG CGCCGCTTCG AGGACGTGAT GGACGAGATC
GCCGGTTTCT TCGAGGTCCA TCGACGGCTT GGTACCCACG CAGGTGGGAT CCACCTCGAG
CTCACCGGAG AGGACGTGAC GGAGTGTCTT GGCGGCTCGG AGGCGGTGCT CGAGTCGGAA
CTGTGTCGTG CCTACGACAC CATCTGCGAT CCTCGGTTGA ACGCGCGCCA ATCACTCGAC
TTGGCGTTTC GCGTCGCTGA ACTCCTGATC CGCTGA
 
Protein sequence
MNALDDAGWQ PWDWQQRVAA QQPEWPDPEA LDAVIKELAQ RPALVVAKDV DRLRAALARA 
ARGRAFVLQA GDCAESFHDH SASSLRAKLK IILQMAVVLT YSSGVSVVKI GRIAGQFAKP
RSAPVEVVDG ATLPSFRGHI VHDDAPTLDA RRPNPERLLW AYDQSRATVS VLRALTEGGF
ADLSGAHRWN LDFVASSPEG QRYQAIADGV DRALRFMAGC GIDLEREAVL HQVNVWTSHE
ALLLPYEAAL TRRDPASQRY YDLSAHMVWV GERTRQLDGA HLRFASGIAN PVGLKVGPTM
EPDTLVEACR ILDPDRTPGR LVLISRMGHD AVRDRLGGLV EAVREAGYPV VWLCDPMHGN
TFVSQSGYKT RRFEDVMDEI AGFFEVHRRL GTHAGGIHLE LTGEDVTECL GGSEAVLESE
LCRAYDTICD PRLNARQSLD LAFRVAELLI R