Gene Spro_4701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4701 
Symbol 
ID5607394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp5195187 
End bp5197229 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content59% 
IMG OID640940267 
Productoligopeptidase A 
Protein accessionYP_001480922 
Protein GI157372933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC CGTTGCTGAC CCCGTTTTCC CTGCCACCGT TTTCCGCCAT TCGCCCAGAA 
GATATCGTGC CTGCGGTGCA ATCCGCATTG GCCGATTGCC GCGCTGCGGT AGAGCGCGTT
GTCGCGCAGC CGGGGCCATT CACCTGGGAT AACCTGTGTC AGCCGCTGGC GGAGTCTGAC
GATCGCCTGT CGCGCATCTG GTCGCCGGTG GGGCATTTGA ACTCGGTAAA AAACAGCCCG
GAACTGCGTG CCGCCTATGA GCAGGCGTTG CCGTTGCTGT CTGAGTACGG CACCTGGGTT
GGGCAACACG AAGGTTTGTA TCAGGCGTAC CGCAGCCTGA AAGAAGGCGA AGCCTTCAAT
CAACTGACCG CACCACAGCG CAAGTCGGTA GAAAATGCGC TGCGTGATTT TGAGCTGTCG
GGCATCGGCC TGTCGCCGGA AAAACAGCGT CGCTATGGTG AAATCGTCGC GCGTCTGTCC
GAGCTGGGTT CTACCTACAG CAACAACGTG CTCGACGCCA CCATGGGTTG GAGCAAGCTG
ATTACCAATG AAACCGAGCT GAGCGGCCTG CCGGAAAGCG CGTTGGCCCA GGCTCAGGCG
ATGGCGCAGG CCAAAGAGCA GGACGGCTGG CTGCTGACGC TGGATATGCC GAGCTATCTG
CCGGTACTGA CCTACGGCGA CAACCGCGCA CTGCGTGAAG AGATGTATCG CGCCTTCGCT
ACCCGCGCTT CCGATCAGGG GCCGAATGCC GGTAAGTGGG ACAACAGCGA AGTGATGGCG
GAAACGCTGG CGCTGCGCCA TGAACTGGCC CAACTGCTGG GCTTTGACAG CTACGCCGAC
AAATCGCTGG CGACCAAAAT GGCGGAAAAC CCGGAGCAGG TGCTCGGTTT CCTCAGCGAT
CTGGCCAAAC GCGCCCGTCC ACAGGCCGAG CAGGAATTGG CGCAGCTGCG CGCCTTCGCC
AAACAGCATT ACGGCGTAGA TGAACTGGAA GCCTGGGACA TTACCTATTA CGGCGAAAAA
CAGAAACAAC ACCTGTTCTC GATCAGCGAC GAGCAACTGC GCCCGTACTT CCCGGAACAG
CGGGTGGTGG AAGGTCTGTT CGAAGTGGTC AAACGCATTT ACGGCATCAC TGCCAAAGAG
CGCAAGGATG TGGATACCTG GCATCCAGAG GTGCGCTTCT TTGATCTGTT CGACGCCAAC
GGCGAGCTGC GCGGCAGCTT CTACCTTGAC CTGTATGCGC GTGAAAACAA ACGCGGCGGG
GCGTGGATGG ACGACTGCGT CGGCAGCCTG CGCAAGGCCA ACGGCGAACT ACAAAAACCG
GTCGCCTATC TGACCTGTAA CTTTAACCGT CCGCTGGGCG ACAAGCCGGC GCTGTTCACC
CATAACGAAG TGACCACCTT GTTCCACGAG TTCGGCCACG GTCTGCACCA TATGCTGACC
CAGATCGACA CCGCCGGCGT TTCCGGCATC AGCGGTGTGC CATGGGATGC GGTCGAACTG
CCAAGCCAGT TTATGGAAAA CTGGTGCTGG GAGCCGGAAG CGCTGGCGTT TATCTCCGGC
CACTATCAGA GCGGTGAACC GCTGCCGAAA GAGATGCTCG ACAAGCTGCT GGCCGCCAAA
AATTACCAGG CGGCGCTGTT TATTCTGCGT CAGTTGGAGT TCGGCCTGTT CGACTTCCGC
ATGCACGCCG AATACAACCC TTCAAGCGGC GCGCAGATCC TGCCAACCTT GGCGGAAGTG
AAGAAAATGG TGGCGGTGGT ACCTTCACCA AGCTGGGGCC GTTTCCCACA TGCTTTCAGC
CATATCTTCG CTGGCGGCTA CGCGGCGGGC TACTACAGCT ACCTGTGGGC GGAAGTGCTG
TCGGCCGATG CCTATTCGCG CTTTGAGGAA GAGGGGATTT TCAACGCCGA AACCGGTAAA
TCCTTCCTCG ACAACATCCT GTCGCGCGGC GGTTCGGAAG AGCCGATGGA ACTGTTCAAA
CGCTTCCGTG GCCGTGAGCC GCAGCTGGAT GCTATGCTGC GCCATTACGG CATTAAAGGA
TAA
 
Protein sequence
MTNPLLTPFS LPPFSAIRPE DIVPAVQSAL ADCRAAVERV VAQPGPFTWD NLCQPLAESD 
DRLSRIWSPV GHLNSVKNSP ELRAAYEQAL PLLSEYGTWV GQHEGLYQAY RSLKEGEAFN
QLTAPQRKSV ENALRDFELS GIGLSPEKQR RYGEIVARLS ELGSTYSNNV LDATMGWSKL
ITNETELSGL PESALAQAQA MAQAKEQDGW LLTLDMPSYL PVLTYGDNRA LREEMYRAFA
TRASDQGPNA GKWDNSEVMA ETLALRHELA QLLGFDSYAD KSLATKMAEN PEQVLGFLSD
LAKRARPQAE QELAQLRAFA KQHYGVDELE AWDITYYGEK QKQHLFSISD EQLRPYFPEQ
RVVEGLFEVV KRIYGITAKE RKDVDTWHPE VRFFDLFDAN GELRGSFYLD LYARENKRGG
AWMDDCVGSL RKANGELQKP VAYLTCNFNR PLGDKPALFT HNEVTTLFHE FGHGLHHMLT
QIDTAGVSGI SGVPWDAVEL PSQFMENWCW EPEALAFISG HYQSGEPLPK EMLDKLLAAK
NYQAALFILR QLEFGLFDFR MHAEYNPSSG AQILPTLAEV KKMVAVVPSP SWGRFPHAFS
HIFAGGYAAG YYSYLWAEVL SADAYSRFEE EGIFNAETGK SFLDNILSRG GSEEPMELFK
RFRGREPQLD AMLRHYGIKG