Gene Spro_1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1910 
Symbol 
ID5606490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2093484 
End bp2094509 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content55% 
IMG OID640937446 
Productaminodeoxychorismate lyase 
Protein accessionYP_001478141 
Protein GI157370152 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000953369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000738863 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGAAAA GAAAGCTGAA GTTCGTTTCT ATTATTGTTG TTCTGGTATT GGGCCTGCTG 
TTTTGGGGCT ACCAGAAGGT TGAACGCTTC GCGGATACGC CACTGGCGAT CCAGCAGGAA
GCCATTTTCA AACTGCCGGC AGGGACCGGT CGGGTAGCCC TGGAGGGGCT GCTGGTGCGG
GACAAACTGA TCCGCAATGG CCAGTGGTTC CCTTGGTTGC TGCGCCTGGA GCCGGAATTG
GCCGAGTTTA AGGCTGGAAC CTATCGCTTT ACGCCGGGTA TGACGGTGCG TCAAATGCTT
AAACTGTTGG CCAGCGGTAA AGAAGCCCAA TTCAGCGCAC GCTTTATTGA AGGTTCCCGC
CTGCGGGACT GGCTGCTGGT GCTGCAACAG TCAAAATACC TCAAACATAC CCTGGCCGGT
AAAAGCGAGG CGGAAATTGC CAAGGCGCTA GGCTTGCCAG AAGGCGCCAA CCCAGAAGGG
CGCCTGTACC CGGATACCTA TCTGTATACC GCAGGCATGA GCGATATGGC GCTGTTGAAG
CGTGCCCACC TGCGTATGAT TAAAGCATTG GAGAGCGCCT GGCAGGGCCG TGAGGCCAGT
TTGCCGTACA AAACGCCGGA AGAGTTGCTG ACCATGGCCT CAATCATTGA GAAAGAGACT
GCGGTACCGG AGGAACGTAC CAAAGTGGCC TCGGTATTCA TTAATCGCCT GCGTATTGGC
ATGCGTTTGC AGACCGACCC GACGGTGATC TACGGCATGG GCGAGGCGTA TAATGGCAAC
ATTACCCGCA AGGATTTGGA AACGCCGACG CCGTACAACA CCTACGTGAT CAACGGTCTG
CCGCCAACGC CGATTGCCAT GCCAAGCCAG GCTTCGCTGG AGGCCGCTGC CAATCCGGCC
AAGACGCCTT ATTTGTACTT TGTTGCCGAC GGTAAGGGCG GGCATCAATT TACCACCAAC
CTGGCCAGCC ATAATCAGGC GGTGCGTGCC TATCGTCAGG CGTTAAAGGA AAAGAATGAA
AAGTAA
 
Protein sequence
MKKRKLKFVS IIVVLVLGLL FWGYQKVERF ADTPLAIQQE AIFKLPAGTG RVALEGLLVR 
DKLIRNGQWF PWLLRLEPEL AEFKAGTYRF TPGMTVRQML KLLASGKEAQ FSARFIEGSR
LRDWLLVLQQ SKYLKHTLAG KSEAEIAKAL GLPEGANPEG RLYPDTYLYT AGMSDMALLK
RAHLRMIKAL ESAWQGREAS LPYKTPEELL TMASIIEKET AVPEERTKVA SVFINRLRIG
MRLQTDPTVI YGMGEAYNGN ITRKDLETPT PYNTYVINGL PPTPIAMPSQ ASLEAAANPA
KTPYLYFVAD GKGGHQFTTN LASHNQAVRA YRQALKEKNE K