Gene Daci_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_4721 
Symbol 
ID5750313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp5174028 
End bp5175128 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content68% 
IMG OID641299826 
Productchorismate mutase 
Protein accessionYP_001565735 
Protein GI160900153 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.997566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CACCCCAAGC CTCTCCCGAT CTGGCGCATC TGCGCGTGCA GATCGACGAT 
ATCGACCAGC AACTGCTGGA TCTGCTGAAC CGACGTGCCC GCGTGGCAGA GCAGGTCGGC
GAGGTCAAGA AGCGTGAAGG CACGCCCTTC TTCCGCCCGG ACCGCGTGGC CCAGGTCATC
CAGAAGATCG AGTCCGCCAA TCCCGGCCCG CTCAAGAATG GCCATGTCTC GGCCATCTGG
CGCGAGATCA TGTCGGCCTG CCTGGCGCTG GAGTCGCCCC AGCGCGTGGC CGTGCTGGGC
CCGGCGGGCA CGTTCTGCGA GGAAGCCGCC ATCCAGTACT TCGGCGGCGC GGCCGATCTG
ATGTACTGCA ACAGCTTCGA CGAGGTGTTC CACGCCACGG CCGCAGGCAG CGCGCAGTAC
GGCGTGGTGG GCGTGGAGAA CTCCAACGAA GGCGTGGTCA CGCGCTCGCT GGACATGTTC
CTGCACACGC CCTGCCACGT GGTGGGCGAG GTCAGCCTGC TGGTGCGCCA CAACCTGCTT
CGCAGCAGCA ACACGACCGA GGGCATCGAG GTCGTGGCAG CCCATCCCCA GGCCCTGGCA
CAGTGCCAGG GCTGGCTGGC CAAGCACCTG CCGCATGCCG AGCGCCGCCC GGTGTCCAGC
AATGCCGAAG GCGCCCGCCT GGCGGCGCTG CACCCCAACA TCGCCGGCCT GGCCAGCGAA
CGCGCGGCCC AGCAATTCGG CCTGCATGTG GTGGCGCATG CCATCCAGGA CGATGCCTAC
AACCGCACGC GCTTCGCCGT CATCTGCCTG CCGCACACGC TGGCCACGCC CTCGCCCAGC
GGCCAGGATT GCACCAGCAT CATCATCTCC GTGCCCAACC GCCCCGGTGC CGTGCATGAC
CTGCTGGTGC CGCTGAAGAA GCACGGCGTG TCGATGACGC GCTTCGAGTC GCGCCCCGCG
CGCACCGGCC AGTGGGAGTA CTACTTCTAC ATCGACCTCG AAGGCCACCC GGCACAGCCC
AACGTGGCCA GCGCGCTGGA AGAGTTGCGC GGCCTGTGCG CCTTCTACAA GGTGCTGGGC
ACCTACCCGG TATCCAAGTG A
 
Protein sequence
MSTTPQASPD LAHLRVQIDD IDQQLLDLLN RRARVAEQVG EVKKREGTPF FRPDRVAQVI 
QKIESANPGP LKNGHVSAIW REIMSACLAL ESPQRVAVLG PAGTFCEEAA IQYFGGAADL
MYCNSFDEVF HATAAGSAQY GVVGVENSNE GVVTRSLDMF LHTPCHVVGE VSLLVRHNLL
RSSNTTEGIE VVAAHPQALA QCQGWLAKHL PHAERRPVSS NAEGARLAAL HPNIAGLASE
RAAQQFGLHV VAHAIQDDAY NRTRFAVICL PHTLATPSPS GQDCTSIIIS VPNRPGAVHD
LLVPLKKHGV SMTRFESRPA RTGQWEYYFY IDLEGHPAQP NVASALEELR GLCAFYKVLG
TYPVSK