Gene Smed_2564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2564 
SymbolaroB 
ID5323432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2661507 
End bp2662649 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID640791507 
Product3-dehydroquinate synthase 
Protein accessionYP_001328229 
Protein GI150397762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.443147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA TGACCAGCGA TGTGATGCCA GCCGCCGAGC GTAAGGTCCG CGTCGATCTT 
GCGGAGCGTT CCTACGATAT CCTCATCGGC CCCGGCCTGA TCGCCGCCGC GGGCGGGGAG
ATCGCCTCGC GGCTCAAGGG CCGGAAGATG GCGGTCATCA CCGACGAGAA TGTTGCGCCG
CGCTATCTCG AACCGTTGAT GGCGAGCCTT GCGGGAAGCG GCATGGATCC GGTCTCCCTG
ATTCTGCCGG CGGGTGAGAA GACCAAGAGC TTCGAGCACC TGATCCCGGT TTGCGAAGCG
GTCCTGGGTG CCAGAATAGA GCGTAACGAC GCGGTGATCG CGCTCGGTGG CGGCGTGATC
GGCGATCTCA CCGGCTTCGC CGCTGGTATC GTCCGCCGCG GCTCTCGCTT CATCCAGATC
CCGACATCGC TGCTGGCGCA GGTCGATTCC TCCGTCGGCG GCAAGACCGG CATCAATTCG
CCGCACGGCA AAAACCTGAT CGGCGTTTTC CACCAGCCGG ACCTCGTTCT CGCCGACACC
GCCGCGCTCG ACACGCTCAG CCCGCGCGAA TTCCGTGCAG GCTATGCCGA GGTCGTGAAA
TACGGCCTGA TCGACAAGCC GGATTTCTTC GAATGGCTTG AGCAGAACTG GCAGGCCGTG
TTTGCCGGCG GACCCGCCCG GATCGAGGCG ATCGCCGTCA GCTGCCAGGC GAAAGCCGAC
GTCGTCGCCG CGGATGAGCG CGAAAACGGG CTGCGTGCGC TGCTCAATCT CGGCCACACC
TTCGGTCATG CCCTGGAAGC CGCCACCGGA TACGACAGCA AACGGCTGGT GCACGGGGAG
GGCGTTGCGA TCGGTATGGT TCTGGCGCAC GAGTTCTCGG CCCGGATGAA CATTGCGAGT
CCCGACGATG CGCGGCGCGT GGAAATGCAT CTGAAGACGG TCGGCCTCCC GACACGCATG
GCCGACATTC CCGGTATGTT GCCGCCCGCC GATCGACTGC TGGAGGCGAT CGCCCAGGAC
AAGAAGGTAA AGGGAGGCAA GTTCACCTTC ATTCTCACCA GGGGCATCGG ACAGTCCTTC
ATCGCGGATG ACGTGCCCTC CTCGGAGGTC CTAAGCTTTC TTGAGGAAAG GATCCCGCGA
TGA
 
Protein sequence
MKPMTSDVMP AAERKVRVDL AERSYDILIG PGLIAAAGGE IASRLKGRKM AVITDENVAP 
RYLEPLMASL AGSGMDPVSL ILPAGEKTKS FEHLIPVCEA VLGARIERND AVIALGGGVI
GDLTGFAAGI VRRGSRFIQI PTSLLAQVDS SVGGKTGINS PHGKNLIGVF HQPDLVLADT
AALDTLSPRE FRAGYAEVVK YGLIDKPDFF EWLEQNWQAV FAGGPARIEA IAVSCQAKAD
VVAADERENG LRALLNLGHT FGHALEAATG YDSKRLVHGE GVAIGMVLAH EFSARMNIAS
PDDARRVEMH LKTVGLPTRM ADIPGMLPPA DRLLEAIAQD KKVKGGKFTF ILTRGIGQSF
IADDVPSSEV LSFLEERIPR