Gene Achl_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2009 
SymbolaroB 
ID7293470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2265615 
End bp2266706 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID643590413 
Product3-dehydroquinate synthase 
Protein accessionYP_002488072 
Protein GI220912763 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000160945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACACCT CATCAACAGT CATCAAGGTC ACCGGTGAAT CCGCCGCCAA CAACTACGAC 
GTTGTGGTGG GGCGGGGCCT GCTGGGAACC CTTCCGGAAA TCCTGGGGGA GCGGGTGCGG
CGCGTGCTGG TCATCCACCC CCGGGCCCTG CGCCTCACCG GTGACACCGT CCGCGATGAC
CTTGAGTCCG CGGGCTTCAC TGCGCTGACC GCGGAAATCC CGGACGCCGA AGAAGGCAAG
CACATCCAGG TCGCTGCCTT CTGCTGGCAG GTCCTGGGCC AGAACGACTT CACCAGGTCT
GACGCCATCG TGGCTGTCGG CGGGGGAGCG GTCACCGACC TGGCGGGCTT CGTGGCCGCC
ACCTGGCTCC GCGGCGTCAA GGTCATCCAC ATGCCCACCA GCCTGCTGGG GATGGTGGAT
GCGTCCGTGG GCGGCAAGAC CGGCATCAAC ACCGCCGAGG GCAAGAACCT GGTGGGCGCC
TTCCACCCTC CGGCGGCGGT CCTGGCGGAC CTGGACACCC TGGACACGCT GCCCCGGAAC
GAACTCATTT CCGGTATGGC CGAAGTGGTC AAGTGCGGCT TCATCGCGGA CCCTGCCATC
CTCGAACTGG TGGAGAAGGA CTTTGCTGCG GTCACCGATC CGCGGTCCGA GACCCTCCGC
GAGCTCATTG AACGTGCCAT CGCCGTCAAA GCCAAAGTGG TTTCGGAAGA CCTCAAGGAA
TCCGGGCTGC GCGAAATCCT CAACTACGGC CACACCCTGG GCCACGCCAT CGAACTGGTG
GAACGCTACT CGTGGCGGCA CGGCGCCGCA GTCTCCGTCG GCATGATGTT CGCCGCGGAA
CTCGCCCGCA GCGTGGGCCG GCTGAGCGAT GCCGACGCCG ACAGGCACCG AAGCATCCTC
GAAGGACTTG GGCTCCCGGT CACCTACCGG CGGGACCGAT GGCAGGGCCT GCTGGACGGC
ATGCGGCGGG ACAAGAAGTC CCGCGGCGAC CTGCTGCGGT TCGTGGTGCT GGACGGTGTG
GCCAAACCGG GCATCCTGGA TGTCCCCGAC ACGTCCCTCC TGTTCGCCGC CTACCAGGAA
GTCGCTTCCT GA
 
Protein sequence
MNTSSTVIKV TGESAANNYD VVVGRGLLGT LPEILGERVR RVLVIHPRAL RLTGDTVRDD 
LESAGFTALT AEIPDAEEGK HIQVAAFCWQ VLGQNDFTRS DAIVAVGGGA VTDLAGFVAA
TWLRGVKVIH MPTSLLGMVD ASVGGKTGIN TAEGKNLVGA FHPPAAVLAD LDTLDTLPRN
ELISGMAEVV KCGFIADPAI LELVEKDFAA VTDPRSETLR ELIERAIAVK AKVVSEDLKE
SGLREILNYG HTLGHAIELV ERYSWRHGAA VSVGMMFAAE LARSVGRLSD ADADRHRSIL
EGLGLPVTYR RDRWQGLLDG MRRDKKSRGD LLRFVVLDGV AKPGILDVPD TSLLFAAYQE
VAS