Gene Spro_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2119 
Symbol 
ID5606137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2313038 
End bp2315074 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content57% 
IMG OID640937655 
Productcarboxy-terminal protease 
Protein accessionYP_001478348 
Protein GI157370359 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000159277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000525254 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAAAT TTGTCAGATT AACAGCAGTC GCGGGTCTGT TGTGGGCGGG TGTCAGTTAC 
GGAGCGGAAC CAGCCAACAT CCGCATCGAT CAACTGCCTC AGCTGCAGCA GGAACCGCAA
CATGCAACTG TGAGTGAGCG CGTAACTTCG CGCTTCACTC GCTCTCATTA CCGTCAGTTT
TCCCTCGACG CGGACTTTTC AGGCAAGATC TTCGATCGTT ATCTGAATAT GCTGGACTAC
AGCCATAACG TGCTGCTGGC CTCCGACGTG GCGCAATTCG CCAACAAGCG CAATCAGCTG
GGCGAAGAAC TGAAAAGCGG TAAGCTCGAT ACGCCATACG CGCTGTACAA TCTGGCGCAG
AAACGCCGTT TTGAGCGTTA CACCTATGCA TTGTCGCTGC TGGAAAAGCC AATGAGCTTC
ACCGGCAACG ACACTATTGA TCTCGACCGC AGCAAAGCGC CGTGGCCGAA AGACAAGGCC
GAACTGGACG CGCTGTGGAA TGCGAAAGTC AAATATGACG AGCTGAACCT CAAGCTGACC
GGCAAGACCG ACAAGGAAAT TCGTGAAACG CTGACCAAGC GCTATCAGTT TGCCATCAAG
CGCCTGACGC AAAGCAACAG CGAAGACGTT TTCCAACTGG CGATGAATGC CTTTGCGCAT
GAAATCGACC CGCATACCAA CTATCTCTCC CCACGCAATA CCGAACAGTT CAATACCGAG
ATGAGCCTGT CGCTGGAAGG TATCGGTGCG GTGTTGCAGA TGGATGACGA TTACACCCTG
ATCAACTCCA TGGTGCCAGG TGGCCCGGCG GCGAAGAGCA AGGCGATCAC CGTGGGTGAC
CGTATTGTCG GCGTTGGCCA GGCGGGCAAG CCTGTGGTCG ATGTGATCGG CTGGCGTCTG
GACGACGTGG TTTCCCTGAT TAAAGGGCCG AAGGGCAGCA AGGTGCGCCT GGAGATCCTG
CCGGCCGGCA AGGGCACTAA AACCCGAGTG GTCACCTTGA CCCGTGAGCG TATCCGTCTG
GAAGACCGCG CGGTGAAAAT GACCATCAAG ACCGTCGGCA AAGAGAAAGT CGCGGTGATG
GACATTCCGG GCTTCTACGT GGGCCTGACC GATGACGTGA AAGTTCAGTT GCAGAAGATG
GCCAAGCAGA ACGTCAAGAG CCTGATCATC GACCTGCGCA CTAACGGCGG CGGCGCACTG
ACCGAAGCGG TTTCGCTGTC CGGTCTGTTC ATTCCGAGCG GCCCGGTAGT GCAGGTACGT
GACAACAACG GTAAAGTGCG TGAAGACGCG GACACCGACG GCGTGACCTA TTACAAGGGG
CCGCTGGTGG TACTGGTTGA CCGTTTCAGC GCCTCGGCTT CGGAGATCTT CGCCGCGGCA
ATGCAGGACT ATGGTCGCGC GCTGATCGTC GGTGAACCGA CCTTCGGGAA AGGCACCGTG
CAGCAGTATC GCTCGCTGAA CCGCATTTAC GATCAGATGC TGCGTCCGGA GTGGCCGGCG
TTGGGGTCGG TGCAATACAC CATACAGAAG TTCTACCGCG TTAACGGCGG CAGTACCCAA
CGTAAGGGGG TTACCCCGGA TATCCTGATG CCGAGCGGCA TTGATCCGGC GGAAACCGGT
GAAGCGTTTG AAGATAACGC TATGCCGTGG GACAGCATCA ATGCGGCGAC CTACACCAAA
ACCGGTGACA TGAAGCCGTT TGAGCCTGAA CTGCTGAAGG ATCATGAGCA GCGTATCGCC
AAGGATCCCG AGTTCCAGTA CATCGCGCAG GATATCGCTC ATTACAAGGC GCTGAAGGAC
AAGCGTAACA TCGTCTCTCT CAACCTGGTT CAGCGCGAGA AAGAGAACCA CGATGATGAC
GCTACCCGTC TGCAACGTGT TAATGATCGC CTGCAGCGCG CCGGTAAAAA GCCGCTGAAG
GCCCTGGAAG ATTTGCCGAA GGATTACCAG GAACCTGACC CATATCTGGA TGAAACCGTG
CACATCGCAC TGGATCTGGC GCACCTTGAT CAGGCGCAGC CGGCGGCGGC GAAATAA
 
Protein sequence
MNKFVRLTAV AGLLWAGVSY GAEPANIRID QLPQLQQEPQ HATVSERVTS RFTRSHYRQF 
SLDADFSGKI FDRYLNMLDY SHNVLLASDV AQFANKRNQL GEELKSGKLD TPYALYNLAQ
KRRFERYTYA LSLLEKPMSF TGNDTIDLDR SKAPWPKDKA ELDALWNAKV KYDELNLKLT
GKTDKEIRET LTKRYQFAIK RLTQSNSEDV FQLAMNAFAH EIDPHTNYLS PRNTEQFNTE
MSLSLEGIGA VLQMDDDYTL INSMVPGGPA AKSKAITVGD RIVGVGQAGK PVVDVIGWRL
DDVVSLIKGP KGSKVRLEIL PAGKGTKTRV VTLTRERIRL EDRAVKMTIK TVGKEKVAVM
DIPGFYVGLT DDVKVQLQKM AKQNVKSLII DLRTNGGGAL TEAVSLSGLF IPSGPVVQVR
DNNGKVREDA DTDGVTYYKG PLVVLVDRFS ASASEIFAAA MQDYGRALIV GEPTFGKGTV
QQYRSLNRIY DQMLRPEWPA LGSVQYTIQK FYRVNGGSTQ RKGVTPDILM PSGIDPAETG
EAFEDNAMPW DSINAATYTK TGDMKPFEPE LLKDHEQRIA KDPEFQYIAQ DIAHYKALKD
KRNIVSLNLV QREKENHDDD ATRLQRVNDR LQRAGKKPLK ALEDLPKDYQ EPDPYLDETV
HIALDLAHLD QAQPAAAK