Gene Achl_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3100 
Symbol 
ID7294580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3443140 
End bp3444243 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content70% 
IMG OID643591510 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002489150 
Protein GI220913841 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CAACAGCCGC AGCAGCACAA TCCACCTCGA ACCTGCGCGT CAGCGAATTC 
ACCCCGCTGC CCACCCCTTC CGAACTGATC GCGGACCTGC CCCTCGACGC ACAGGCCGCT
GCCGTCGTCG AACGCGGCCG CGATGAAGTC CGCGCCATCA TGGACGGCGT GGACGACCGC
CTGCTGGTGA TCGTGGGACC GTGCTCCATC CACGATCCCA AGGCCGGGCT GGAATACGCC
CGCCGGCTGG TCAGCCAGGC TGAGAAGCAC AAGGAAGACC TGCTGATCGT CATGCGGACC
TACTTCGAGA AGCCCCGCAC CACCGTTGGC TGGAAGGGCC TGATCAACGA TCCGCGGCTG
GACGGCAGCC ACGACATGGT CACCGGCCTG CGGACCGCAC GCCACTTCCT CCAGCAGGTC
ACCGCCCTGG GACTGCCGAC GGCCACCGAG TTCCTCGAAC CGATCAGCCC GCAGTACATG
GCGGACCTCA TCTCCTGGGG CGCCATCGGG GCCCGCACCA CGGAGAGCCA GATCCACCGC
CAGCTGGCAT CCGGCCTGTC CATGCCCATC GGCTTCAAGA ACGGGACCGA CGGCGGCCTG
CAGGTTGCCA TCGACGCCTG CGGTGCCGCC GCGGCAGCCC AGGCGTTCCT GGGGATCGAC
GGCGACGGCC GGGCCGCGCT GGTGGCCACC GCCGGCAACC CGGACACGCA CGTCATCCTC
CGCGGCGGGC GCAAGGGGCC CAACTACTCC ACGGCAGACG TCGAAGCGGC CTCCGCCACC
CTGGCCGGCA AGGGGCTGAA CCCGCGCCTG ATCGTGGACG CCAGCCACGC CAACAGCGGC
AAGAGCCACC ACCGGCAGGC GGAAGTGGCC CTGGAAATCG GTGCACAGCT TGAAGAAGGC
GGCCCGGCCG CCCAGGCGAT CGCCGGCGTC ATGCTGGAAA GCTTCCTGGT GGGAGGCGCC
CAGAACCTGG ACGTCGTGGA GCACGCGGCC GGCCGGGATG AGCTGGTCTA CGGGCAGAGC
GTCACGGATG CGTGCATGGA GTGGGACGTC TCGGCGTCGG TCCTGGAGCA GCTGGCCGCC
TCAGCCCGGA AGCGCCGCGG CTGA
 
Protein sequence
MSTATAAAAQ STSNLRVSEF TPLPTPSELI ADLPLDAQAA AVVERGRDEV RAIMDGVDDR 
LLVIVGPCSI HDPKAGLEYA RRLVSQAEKH KEDLLIVMRT YFEKPRTTVG WKGLINDPRL
DGSHDMVTGL RTARHFLQQV TALGLPTATE FLEPISPQYM ADLISWGAIG ARTTESQIHR
QLASGLSMPI GFKNGTDGGL QVAIDACGAA AAAQAFLGID GDGRAALVAT AGNPDTHVIL
RGGRKGPNYS TADVEAASAT LAGKGLNPRL IVDASHANSG KSHHRQAEVA LEIGAQLEEG
GPAAQAIAGV MLESFLVGGA QNLDVVEHAA GRDELVYGQS VTDACMEWDV SASVLEQLAA
SARKRRG