Gene Spro_4624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4624 
Symbol 
ID5603959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5100396 
End bp5101430 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content58% 
IMG OID640940190 
Productcupin 2 domain-containing protein 
Protein accessionYP_001480845 
Protein GI157372856 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG ATATTCAGCA ACTGCAAAAG AAACATCTGT TCCCGCTATG GGAGTCACTG 
CATGGCCTGG TCCCCAATCA GCCAGAACCT CAGGCTACGC CCTATCAATG GCATTACGCT
GAGGTCAAAG ACGCGCTGCT GGATATCGGC TCAAAGATCG ACATCGAACG TGCCGAACGC
CGCGTGCTGG TCATGGAAAA TCCGTCGCTG CCACCGGGCT CGTCACGCAT CACCGACACG
CTGTATGCCG GAATGCAAAT GGTGCTGCCG GGTGAAATTG CCCCATTGCA TCGCCACACC
CCCACCGCGC TGCGCTTTAT TCTGGAAGCC GAAGGCGGCA ACACCACGGT GGACGGCGAA
AAGACTACGC TGCACCCGGG GGATTTTATC ATTACCCCTT CATGGCGTTG GCACCAGCAC
CAAAATGACA CCGATAAACC GATATTCTGG CTCGATGGCC TGGATGCCCC GCTGCTGCAC
TTTCTCAAGG CCGGGTTTCG GCAAGATCGC CTGCCCGCAG GCCAAACGTT GGAGCCACGT
CCTGAGGGCG ATGCTCTGGC CCGTTACGGT ACCAGCCTGA TACCGCTGGA GTACCGCCCA
ACGGGGCAGA GCCAGCCCTC GCCGATATTT AACTATCCGT ATCAACGCAC CCGCGAAGCG
CTGGAACAAT TAAAGCGCCA TAGCGAGATC GACCCGGCCA GCGCCATCAG CCTGCGTTAT
ATCAATCCGG CCAACGGCGA TTGGGCCATT CCCACCCTCG GCACCGTCAT CACGCTGCTG
CCAAGAGGCT TTACCAGCCT CTACCAACGT GGCAATGCCA GCCAGGTGCT GGTGGTGATG
GAGGGAGAAC TGGAAGTCCG GTTAACCGGC GGGATTCACT TCCGGCTAAA ACCGAAAGAC
ATTTTTGCGC TGCCTTCGTG GCTCAGCTAT CAGCTGTCGG CACCTGTCGG CGATACGGTC
TGCTTCAGCT TTTCCGATCG CCCGGTGCTG GAAAAACTGG GGATCTGGCG GCACGAGATC
ACGCCTGAGG GCTAA
 
Protein sequence
MSIDIQQLQK KHLFPLWESL HGLVPNQPEP QATPYQWHYA EVKDALLDIG SKIDIERAER 
RVLVMENPSL PPGSSRITDT LYAGMQMVLP GEIAPLHRHT PTALRFILEA EGGNTTVDGE
KTTLHPGDFI ITPSWRWHQH QNDTDKPIFW LDGLDAPLLH FLKAGFRQDR LPAGQTLEPR
PEGDALARYG TSLIPLEYRP TGQSQPSPIF NYPYQRTREA LEQLKRHSEI DPASAISLRY
INPANGDWAI PTLGTVITLL PRGFTSLYQR GNASQVLVVM EGELEVRLTG GIHFRLKPKD
IFALPSWLSY QLSAPVGDTV CFSFSDRPVL EKLGIWRHEI TPEG