Gene Spro_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4604 
SymbolaroB 
ID5605908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5079201 
End bp5080301 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content57% 
IMG OID640940170 
Product3-dehydroquinate synthase 
Protein accessionYP_001480825 
Protein GI157372836 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000760296 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGAA TTACCGTAAC GCTTGGGGAG CGCAGCTACC CGATAACCAT AGCCGCCGGA 
TTGTTTAACG ATCCGGCTTC TTTTATGCCG CTGAAGGCGG GTGAACAGGT CATGCTGGTC
ACCAACCAAA CCCTGGCGCC ACTCTATCTG GACCACGTCC GGAAGGTGTT GGAGCAGGCA
GGCGTCATGG TGGATCAGGT GATTTTGCCT GATGGCGAAC AGTATAAATC TCTGGCCGTA
CTCGAGCAGG TGTTCTCGGC ACTGTTGGAA AAGCCGCACG GTCGTGATAC CACGCTGATT
GCCCTTGGGG GCGGCGTGAT TGGCGATCTT ACCGGCTTTG CCGCCGCCTG TTATCAGCGC
GGTGTCCGCT TTATTCAGGT CCCTACCACG CTGTTGTCGC AGGTGGACTC TTCCGTTGGC
GGTAAAACCG CCGTCAATCA TCCGCTCGGC AAGAACATGA TCGGCGCCTT CTATCAACCC
GTTTCTGTGG TGGTTGATCT CGATTGCCTG AAAACCTTAC CGGCGCGTGA GCTCTCCTCT
GGTTTGGCTG AAGTGATCAA GTACGGGATT ATTCTCGACC ACGATTTCTT CGTCTGGCTG
GAAAACAATA TCGATGCCCT GGTGGCGCTG GATATGCAGG CTCTGGCCTA CTGTATCCGT
CGCTGCTGCG AGCTGAAAGC TGAGGTGGTT GCTGCTGACG AACGCGAAAG CGGGCTGCGC
GCGCTGCTGA ATCTGGGCCA TACTTACGGC CATGCGATCG AAGCCGAAAT GGGCTATGGT
GTATGGTTGC ACGGTGAGGC CATTGCCGCC GGTATGGTGA TGGCGGCAGA AACCGCGCAC
CGTCTCGGCC AGTTCTCCCG CGAAGATATT GAACGTATTA AAGCACTGTT GTTGCGCGCC
GGTTTACCAG TGTGTGGCCC GCAGGAAATG GCTCCGGGAA CTTATCTGCC GCATATGATG
CGCGATAAGA AAGTCCTGGC CGGTGAATTG CGCCTGGTAC TGCCGACGGC CATTGGCCAG
GCGGAAGTCC GTGGCGGAGT GGGGCATGAT ATGGTGCTCG CTTCGATCGC AGCTTGCTTT
CCTGACGGAA TGTCTAAGTA A
 
Protein sequence
MERITVTLGE RSYPITIAAG LFNDPASFMP LKAGEQVMLV TNQTLAPLYL DHVRKVLEQA 
GVMVDQVILP DGEQYKSLAV LEQVFSALLE KPHGRDTTLI ALGGGVIGDL TGFAAACYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP VSVVVDLDCL KTLPARELSS
GLAEVIKYGI ILDHDFFVWL ENNIDALVAL DMQALAYCIR RCCELKAEVV AADERESGLR
ALLNLGHTYG HAIEAEMGYG VWLHGEAIAA GMVMAAETAH RLGQFSREDI ERIKALLLRA
GLPVCGPQEM APGTYLPHMM RDKKVLAGEL RLVLPTAIGQ AEVRGGVGHD MVLASIAACF
PDGMSK