Gene Spro_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4201 
Symbol 
ID5602783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4656838 
End bp4658289 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content61% 
IMG OID640939761 
Productamidohydrolase 
Protein accessionYP_001480423 
Protein GI157372434 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.888012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACC CGCAACTGCT TACGTTTATC AATCAGTACA TTGACCAACA CCAGCCGCAG 
TTCAGCGCAC TGAGCGACAG CATCTGGGAT CACCCGGAGA CGCGCTTTAC CGAAACCTAT
TCTGCCAATC TGCTGGCCGA CGCGCTGGAG CAGGAAGGTT TTCGGCTTGA GCGCGGCGTT
GGCGGCATTG AAACCGCGTT TATCGCCAGC TACGGCAGCG GCCAGCCGGT GATCGCGCTG
CTGGGCGAGT ACGACGCTCT CGCCGGACTG AGCCAACAGG CCGGCTGTGC CACGCCGCAG
CCGCTGGTGG AAAACGGCAA CGGCCACGGC TGCGGCCACA ATCTGCTGGG CACCGCCGCG
CTGGCGGGGG CATTCGCGGT GAAGGCCTGG ATGCAACAGC AGCATCTGTC CGGCACCGTG
CGTTTCTACG GCTGCCCGGG TGAAGAAGGC GGCTCCGGCA AAACCTTTAT GGTGCGTGAA
GGCCTGTTCG ATGACGTTGA TGCCGCCCTC ACCTGGCATC CGGAAGGCTT CAGCGGCATG
TTCAATACCA GCACTTTGGC CAACATTCAG GCGGCGTTTC AGTTCAAGGG CATCGCTGCT
CACGCCGCCA ACTCACCGCA CCTTGGCCGC AGCGCGCTGG ATGCGGTAAC GCTGATGAAC
ACCGGTGCCA ACTTCCTGCG TGAGCACATC GTACAGGAAG CCCGGCTGCA CTATGCCGTC
ACCAATACCG GTGGCAGTTC ACCCAATGTG GTGCAAGCCG ATGCCGAGGT GCTGTACCTG
GTCCGCGCGC CACAGCTCGA TCAGGCGCAG GATATTTACC AACGGGTGAT CAACATCGCC
AAAGGCGCAG CGCTGATGAC CGATACCCAG ATGACGGTGC GTTTCGACAA GGCTTGCTCC
AACTACGTGC CAAACCGCAG CATGGAGCAG GTGATGTATC GCTATGTCTG CGACTTCGGC
CTGCCGGAAT ACAGTGAGGC GGAACGCGAA TTCGCCGGCG AAATCCGCCA AACGCTCAAC
AAAGATGACC TGCGTAATGC CAGGTTAAAT ATCGCCCGTA CCGGCGGCGC GGCGGGCCGC
GAGTGGGTTC AGAATTTGGG CGACAAGGTG TTGATGGATC AGGTAGCTCC TTATGTGGCA
TCGGAAGATC TACTGTACGG CTCTACCGAC GTCGGCGATG TCAGTTGGGT CGCGCCGACC
GCCCAGTGCT TCAGCCCCTG CTTTGCGTTC GGCACTCCGC TGCACACCTG GCAACTGGTG
GCACAGGGTC GCACTTCGAT CGCCCACAAA GGCATGTGCC TGGCCGGCAA GGTGATGTCG
GCCACCGCCG TCGAACTGCT GAGCGACAGC GCCCTGCTGG CGGACTGCCG CCGCGAGTTC
GAAGGCCAGC GCGCCGAACA GCCTTATAGC TGCCCGATCC CTAAAGACAT CAGGCCTTCC
CCGTTAAAGT AA
 
Protein sequence
MSNPQLLTFI NQYIDQHQPQ FSALSDSIWD HPETRFTETY SANLLADALE QEGFRLERGV 
GGIETAFIAS YGSGQPVIAL LGEYDALAGL SQQAGCATPQ PLVENGNGHG CGHNLLGTAA
LAGAFAVKAW MQQQHLSGTV RFYGCPGEEG GSGKTFMVRE GLFDDVDAAL TWHPEGFSGM
FNTSTLANIQ AAFQFKGIAA HAANSPHLGR SALDAVTLMN TGANFLREHI VQEARLHYAV
TNTGGSSPNV VQADAEVLYL VRAPQLDQAQ DIYQRVINIA KGAALMTDTQ MTVRFDKACS
NYVPNRSMEQ VMYRYVCDFG LPEYSEAERE FAGEIRQTLN KDDLRNARLN IARTGGAAGR
EWVQNLGDKV LMDQVAPYVA SEDLLYGSTD VGDVSWVAPT AQCFSPCFAF GTPLHTWQLV
AQGRTSIAHK GMCLAGKVMS ATAVELLSDS ALLADCRREF EGQRAEQPYS CPIPKDIRPS
PLK