Gene Dtpsy_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_2046 
Symbol 
ID7385039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp2190328 
End bp2191428 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID643655365 
Productchorismate synthase 
Protein accessionYP_002553502 
Protein GI222111238 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.226148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA ACACACTCGG TACCCTTTTT TGCGTCACCA ACTTTGGTGA ATCCCATGGC 
CCCGCCATCG GCTGCGTGAT CGACGGCTGC CCGCCCGGCA TGGAACTGTC CGAGGCCGAC
ATTCAGGCCG ACCTGGACCG CCGCCGCCCC GGCACCAGCC GCCATGTGAC GCAGCGCAAC
GAGCCGGATG CGGTGGAAAT CCTCTCGGGC GTGTATGAGG GCAAGACCAC CGGCACGCCC
ATCGCGCTGC TGATCCGCAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCGCGCAG
AGCTTCCGCC CGGGTCATGC CGACTATGCC TACTGGCACA AGTACGGCCT GCGCGACCCG
CGCGGAGGCG GACGCTCGTC TGCGCGCCTC ACGGCACCCA CCGTGGCTGC CGGCGCGGTG
GCCAAGAAAT GGCTGGCCGA GAAATACGGC ACCCGCTTTC GTGCCTGCAT GACCCAGCTG
GGCGAACTGC CCATCCCGTT CGAGAATTGG GAGCATGTGC CGCACAACCC CTTCTTCGCA
CCGGTGGCCG ACGTGCAGGC CTACGAGGAC TACATGGACG CGCTGCGCAA GTCCGGCGAC
TCCTGCGGCG CGCGCATTCG TGTGCAGGCC ACCAGCGTGC CCGTGGGGCT GGGCGAGCCG
CTGTACGACA AGCTGGACGC CGACATCGCC CATGTGATGA TGGGCCTGAA CGCGGTGAAG
GGTGTGGAGA TTGGGGCCGG CTTTGCCAGC GTGGCCCAGC GCGGTACCAC GCATGGCGAT
TCGCTCACGC CCACGGGCTT CGCCAGCAAC AACGCGGGTG GCGTGCTGGG CGGCATCAGC
ACGGGGCAGG ACATCGAGGT TTCGCTGGCC ATCAAGCCCA CCAGTTCCAT CATCAGCCCG
CGCGAGTCCA TCGACATCCA CGGCCAGAGT ACCGAGGTGA TCACGAAGGG GCGCCACGAC
CCCTGCGTGG GCATCCGCGC CGCGCCGATC GCCGAGGCGT TGCTCGCATT GGTCATCATG
GACCATGCGC TGCGCCACCG TGCGCAATGC GGCGACGTGG TGCAGGCCGT GGCTCCGATT
CCGGCAGTCC GCCTGGGGTG A
 
Protein sequence
MSGNTLGTLF CVTNFGESHG PAIGCVIDGC PPGMELSEAD IQADLDRRRP GTSRHVTQRN 
EPDAVEILSG VYEGKTTGTP IALLIRNTDQ RSKDYGNIAQ SFRPGHADYA YWHKYGLRDP
RGGGRSSARL TAPTVAAGAV AKKWLAEKYG TRFRACMTQL GELPIPFENW EHVPHNPFFA
PVADVQAYED YMDALRKSGD SCGARIRVQA TSVPVGLGEP LYDKLDADIA HVMMGLNAVK
GVEIGAGFAS VAQRGTTHGD SLTPTGFASN NAGGVLGGIS TGQDIEVSLA IKPTSSIISP
RESIDIHGQS TEVITKGRHD PCVGIRAAPI AEALLALVIM DHALRHRAQC GDVVQAVAPI
PAVRLG