Gene Arth_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3302 
Symbol 
ID4443996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3705998 
End bp3707170 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content69% 
IMG OID639691126 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_832778 
Protein GI116671845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG CAGCAGCCAC CGAGACACAG CCTTCCACTT CCCCGGAAGC CAACGGTTCC 
CCGGAAGGCC CCGAGACCGC GCAGAAATCC ACCTCGAACC TGCGCGTCAG CGAATTCCAG
GCCCTGCCGT CCCCCCAGGA CATGATCGCT GCACTGCCGC TGGACGCCCG GGTGGCAGAC
GTCGTCGAAC GCGGCAGGGA CGAAGTCCGG GCCATCATGG ACGGCGTGGA CGACCGCCTG
CTGGTGATCG TGGGACCCTG CTCCATCCAC GACCCCAAAG CCGGCTTGGA GTACGCCCGC
CGGCTGGTGA GCCAGGCGGA GAAGCACAAG GAAGACCTGC TGATCGTCAT GCGGACTTAC
TTCGAGAAGC CGCGCACCAC GGTGGGCTGG AAGGGCCTCA TCAACGATCC CCACCTGGAT
GGCAGCCATG ACATCGCCAC CGGACTGCGG GCAGCACGCC AGTTCCTGAA GCAAGTCACG
TCGCTGGGCC TCCCCACCGC AACGGAGTTC CTGGAACCCA TCAGCCCGCA GTACATGGCC
GACCTCGTCT CCTGGGGCGC CATCGGTGCC CGCACCACCG AAAGCCAGAT CCACCGCCAG
CTGGCCTCCG GGCTGTCCAT GCCCATCGGA TTCAAGAACG GGACCGACGG CGACCTCCAG
GTTGCGGTCG ATGCCTGCAG CGCCGCCGCG GCATCTCAGG CCTTCCTGGG GATCGACGGC
GACGGCCGGG CCGCACTCGT GGCCACCGCC GGCAACCCGG ACACGCACGT GATCCTCCGC
GGCGGACGCA AGGGCCCCAA CTACTCCGCG GCGGATGTCG AAGCGGCCTC GGCGAAACTG
GCCGGCAAGC AGCTCAACCC CCGGCTGATC GTGGACGCCA GCCACGCCAA CAGCGGCAAG
AGCCACCACC GCCAGGCCGA AGTGGCGCTC GAAATCGGCG CGCAGCTGGA AGACGGCGGG
GCAGCGGCTG CGGCGATTGC CGGCGTCATG CTGGAGAGCT TCCTGGTGGG CGGGGCGCAG
AACCTCGATG TGGCCGAGCA CGCGGCCGGC ACCGGCGAAC TGGTCTACGG CCAGAGCGTG
ACGGATGCCT GCATGGAATG GGACGTCACG GCGTCGGTGC TCGGCCAGCT GGCAGCATCC
GCCCGCAAGC GGCGGGGCGC CCTGGAGGGC TGA
 
Protein sequence
MSTAAATETQ PSTSPEANGS PEGPETAQKS TSNLRVSEFQ ALPSPQDMIA ALPLDARVAD 
VVERGRDEVR AIMDGVDDRL LVIVGPCSIH DPKAGLEYAR RLVSQAEKHK EDLLIVMRTY
FEKPRTTVGW KGLINDPHLD GSHDIATGLR AARQFLKQVT SLGLPTATEF LEPISPQYMA
DLVSWGAIGA RTTESQIHRQ LASGLSMPIG FKNGTDGDLQ VAVDACSAAA ASQAFLGIDG
DGRAALVATA GNPDTHVILR GGRKGPNYSA ADVEAASAKL AGKQLNPRLI VDASHANSGK
SHHRQAEVAL EIGAQLEDGG AAAAAIAGVM LESFLVGGAQ NLDVAEHAAG TGELVYGQSV
TDACMEWDVT ASVLGQLAAS ARKRRGALEG