Gene Spro_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1937 
Symbol 
ID5606876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2121440 
End bp2123473 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content56% 
IMG OID640937475 
Productoligopeptidase B 
Protein accessionYP_001478168 
Protein GI157370179 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.158885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0155167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAC CCCCTAAGGC AGAAAAACGA CCTTATCCCA TTACTACCCA CGGCGACACG 
CGGGTAGATG ACTATTATTG GCTGCGCGAC GACGAGCGCG CAGACCCGCA GGTGCTGGAC
TATCTGCAGG CGGAAAACGC CTTTACCGAC GCGGTATTGA AACCGCAACA GGCGCTGCGC
GAAACTCTGT ATGAAGAGAT GGTGGCGCGT ATTCCTCAGC AGGAACATTC GGTGCCTTAC
GTCCGACACG GCTACCGCTA TCAGACCCGC TTTGAACCGG GCAATGAATA CGCGATTTAC
GTCCGCCAAC CGCAGGCCGA GAGCGAGCAT TGGGATATCC TGATCGACGG CAATCAACGT
GCCGAGCAGC GTGAGTTTTA TACCCTCGGC GGGCTGGAAG TCAGTCCCGA TAACCAAAAA
CTGGCGGTTG CGGAAGATTT TCTCTCTCGT CGCCAGTACG ACATTCGTTT CAAAAACCTC
AGCGACGATA GCTGGACCGA CGAAGTGCTG GAAAACACCT CCGGCAGCTT TGAATGGGCC
AACGACTCGG CCACCGTGTA TTACGTACGC AAACACGCCA AAACGTTGCT GCCTTATCAG
GTCTATCGCC ACGTGGTGGG CACCGATCCG CAGCTCGATG AGCTGATCTA CGAAGAAAAG
GATGACACCT TCTACGTCGG GTTGGAAAAG ACCACCTCCG ATCGTTTTAT TCTGATCCAT
CTGAGCAGCA CCACCACCTC GGAAATCCTG CTGCTGGACG CCGATCGCGC CGACAGCAAG
CCGCAGATGT TTGTGCCGCG TCGTAAGGAT CATGAATACG GTATCGATCA CTATCACCAG
CATTTCTATA TCCGTTCCAA CAAGGACGGC AAGAACTTCG GCCTGTATCA AAGTGATCAG
GCGGATGAAG CGCAGTGGCA GACGCTGATC GCTCCGCGCA TCGAAGTGAT GCTGGAGGGC
TTTAGCCTGT TCCGTGACTG GCTGGTGGTG GAGGAGCGCA GCGAAGGTCT GACCCAACTG
CGACAGATCC ACTGGCAGAG CGGCGAAGTG AAGCGCATTG CCTTCGACGA CCCCACCTAC
ACCACCTGGC TGGCGTACAA CCCGGAACCG GAAACCGAGC TACTGCGCTA TGGCTATTCT
TCGATGACCA CGCCGACCAC GCTGTATGAG CTTAATCTCG ACAGCGGTGA GCGCGTGATG
CTCAAACAGC AGGAAGTGAA AAACTTCACT CCGGAAAATT ACCGTAGCGA GCGGGTATGG
GTGAAGGCGC GTGATGGCGT TGAAGTCCCG GTTTCGCTGG TCTACCGCCA CGATAAATTT
ACGCGCGGCA CCAACCCGCT GATGGTGTAT GGCTACGGCT CTTACGGCAG CAGCATGGAT
CCGGCCTTCA GCGCCAGCCG CTTAAGCCTG TTGGATCGCG GTTTTGTGTT TGTGCTGGCG
CACATTCGCG GCGGCGGTGA GCTGGGGCAA CTGTGGTATG AAGACGGCAA ACTGTTCAAA
AAGCAAAACA CCTTCAACGA TTTCATCGAC GTGACCGAGG CGTTGATCGC CCAGGGTTAC
GGTGACGCCA AACGGGTATT TGCCATGGGT GGCAGTGCCG GTGGCTTGCT GATGGGCGCG
GTGATTAACC AGGCACCCAG GTTGTTCAAC GGCATTGTGG CGCAGGTGCC TTTTGTTGAT
GTGGTCACCA CCATGCTCGA CGAGTCAATT CCGCTGACGA CCGGCGAGTA CGACGAATGG
GGCAACCCGA ACGAGCAGGC CTACTACGAC TACATTTTGC AATACAGCCC TTACGATCAG
GTCAAGGCGC AGGATTATCC GCATATGCTG GTCACCACCG GTTTGCATGA CTCTCAGGTA
CAGTATTGGG AGCCGGCCAA GTGGGTGGCC AAGCTGCGTG AGTTGAAGAC CGACGATCGC
CAGCTGTTGC TGTATACCGA TATGGATTCC GGCCACGGCG GCAAGTCCGG GCGTTTCAAA
GCTTATGAAG ATATCGCGCT GGAGTACGCC TTTATTCTGG CGCTGGCGGA GTAA
 
Protein sequence
MMTPPKAEKR PYPITTHGDT RVDDYYWLRD DERADPQVLD YLQAENAFTD AVLKPQQALR 
ETLYEEMVAR IPQQEHSVPY VRHGYRYQTR FEPGNEYAIY VRQPQAESEH WDILIDGNQR
AEQREFYTLG GLEVSPDNQK LAVAEDFLSR RQYDIRFKNL SDDSWTDEVL ENTSGSFEWA
NDSATVYYVR KHAKTLLPYQ VYRHVVGTDP QLDELIYEEK DDTFYVGLEK TTSDRFILIH
LSSTTTSEIL LLDADRADSK PQMFVPRRKD HEYGIDHYHQ HFYIRSNKDG KNFGLYQSDQ
ADEAQWQTLI APRIEVMLEG FSLFRDWLVV EERSEGLTQL RQIHWQSGEV KRIAFDDPTY
TTWLAYNPEP ETELLRYGYS SMTTPTTLYE LNLDSGERVM LKQQEVKNFT PENYRSERVW
VKARDGVEVP VSLVYRHDKF TRGTNPLMVY GYGSYGSSMD PAFSASRLSL LDRGFVFVLA
HIRGGGELGQ LWYEDGKLFK KQNTFNDFID VTEALIAQGY GDAKRVFAMG GSAGGLLMGA
VINQAPRLFN GIVAQVPFVD VVTTMLDESI PLTTGEYDEW GNPNEQAYYD YILQYSPYDQ
VKAQDYPHML VTTGLHDSQV QYWEPAKWVA KLRELKTDDR QLLLYTDMDS GHGGKSGRFK
AYEDIALEYA FILALAE