Gene YpsIP31758_3965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3965 
SymbolaroB 
ID5384485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4447414 
End bp4448502 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content50% 
IMG OID640866996 
Product3-dehydroquinate synthase 
Protein accessionYP_001402913 
Protein GI153949291 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000154886 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAGA TTACTGTCAC GTTAGGGGAA CGTAGCTACC CCATTACGAT TGCCGCCGGG 
TTGTTTAATG ATCCGGCCTC TTTTAAGCCG CTAAAGGCGG GTGACCAGGT TATGCTGGTC
ACTAACCAAA CGTTGGCTCC GCTCTATCTG GATTCTCTCC GGGCAGTGTT GGAACACGGT
GGCATTAAAG TTGATCAGGT GATTTTACCT GATGGTGAGC AGTATAAATC TCTGAGCGTT
ATGGAGCAGG TTTTTTCTGC CCTTCTGGAA AAACCGCACG GTCGTGATAC TACGTTGGTT
GCCTTAGGTG GCGGCGTAGT GGGCGACCTG ACCGGTTTTG CCGCAGCTTG CTATCAACGC
GGTGTGCGCT TTATTCAAGT TCCTACTACT TTACTTTCTC AAGTGGATTC TTCTGTTGGT
GGTAAAACCG CCGTTAACCA TCCCTTGGGT AAAAATATGA TTGGTGCCTT CTACCAGCCT
GCATCGGTGG TGGTTGATCT TAATTGTCTT AAAACTCTCC CCCCGCGTGA ACTCGCTTCT
GGCTTGGCTG AAGTGATCAA ATACGGCATC ATCCTTGATG CAGCTTTCTT CGATTGGTTA
GAAAACAACA TTGACGCTTT ATTAGCGCTG GATATGTCAG CATTAGCTTA CTGTATTCGC
CGCTGTTGCG AATTAAAGGC CGATGTCGTT GCTGCTGATG AACGCGAAGA GAGCGGTGCG
CGCGCTTTAC TCAATTTGGG TCACACCTAT GGTCATGCTA TCGAAGCTGA AATGGGCTAC
GGAGTGTGGT TACACGGTGA AGCGGTTGCT GCTGGGATGG TGATGGCCGC ACAGACATCC
CGTCGTCTGG GGCAACTCTC TGTCAGTGAT GTTGAGCGTA TCAAGAAACT CCTATTACGT
GCTGGTCTAC CCGTTTGTGG GCCTAAAGAA ATGGCACCAG AATCTTATCT GCCACACATG
ATGCGGGATA AAAAAGTATT GGCGGGTGAA CTTCGTCTGG TACTGCCAAC GGCCATCGGT
AAATCAGAAA TCCGGGGCGG TGTTGCGCAT GATATGGTGT TGGCATCGAT AGCGGATTGT
CGGCCATAG
 
Protein sequence
MEKITVTLGE RSYPITIAAG LFNDPASFKP LKAGDQVMLV TNQTLAPLYL DSLRAVLEHG 
GIKVDQVILP DGEQYKSLSV MEQVFSALLE KPHGRDTTLV ALGGGVVGDL TGFAAACYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLNCL KTLPPRELAS
GLAEVIKYGI ILDAAFFDWL ENNIDALLAL DMSALAYCIR RCCELKADVV AADEREESGA
RALLNLGHTY GHAIEAEMGY GVWLHGEAVA AGMVMAAQTS RRLGQLSVSD VERIKKLLLR
AGLPVCGPKE MAPESYLPHM MRDKKVLAGE LRLVLPTAIG KSEIRGGVAH DMVLASIADC
RP