Gene Spro_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3031 
Symbol 
ID5604103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3335216 
End bp3336433 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content57% 
IMG OID640938572 
Productpeptidase M24 
Protein accessionYP_001479260 
Protein GI157371271 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAG GCATTGGCGG TTGCACGGCG CAACAGGCCC TGGAACAGTT GCAACCGTTG 
ACGGACAATC CGCCTGAAAT CGCCGGCGAT GAGTATCGGC AACGCATTCA GCATGCGCAG
CGCCTGATGC GCGAAAACGG TATCGACGCC TGCTATGTCA ACGCCGGCAG TAATCTGCGC
TATTTTACCG GTACTCAATG GTCGGCGAGT GAACGCATGG TCGGTGCGGT GATCCCGACC
GAAGGCGAAA TCGCCTATAT TGCCCCCTGG TTTGAAACTG GCACCTTCAA AGACGCACAG
GTGATAGAAG CAGAAATTTT TAGCTGGCAT GAGGAAGAAG ATCCCTACCG GTTGTTCTTC
ACCGTACTGG CCTCTCGCGG GCTAACGGGC GCACGCCAGG TGGCAATTTG CGAAACGGCT
TCGGTCACCC TGTTCCTCGG CCTGCAACAA TATGCCGGCG ATATCAGGCT GATCAGTGCC
CAGCCGATCA CCGGCCATTG CCGCAGCCGT AAATCGGCGA CGGAAATCGT ACTGATGCAA
ACCGCCAATA ATATTACTCT GCGGGTGCAA CAGGCTGCGG CCAGCATATT GCGCCCTGGC
ATCACCGCCA GCGAACTGAT CGACTTCGTC GATAAAGCCC ACCGTAAAAT GGGCACCAGC
GGCTCGTACT TCTGTATTGC ACTGTTCGGA TCCGATAGCG CGTTCCCCCA TGGCGTGAAA
CAACCCAACC CGCTGCAAAA CAACGATATC GTGCTGCTCG ATACCGGCTG CCGCTATAAG
GGTTACCTGT CCGATATCAC CCGTACCTAT GTGTACGGCG AAGCCAACGA ACGGCAGCGT
TTCGCCTGGC AGGCGGAGCA TGAGGCGCAG GCCGCCGCTT TTGCCGTTAT CGCGCCCGGT
GTGCCCTGCC ATAAAGTGGA TGACGCGGCA CGCGATGTGC TGGTTTCTTA TGGATTTGGG
CCGGACTATC AGCTGCCGGG CCTGCCGCAT CGCACCGGGC ACGGTATCGG GCTGGATATT
CATGAAGCCC CGTATCTGAT CCGCAAACAG CAGCAACCGC TCGATGTCGG CATGTGCGCC
AGCATCGAAC CCATGCTGTG CCTGCCGGGT GAGTTTGGCA TCCGCCTTGA AGATCATTTT
TACGTCACCC ACGAAGGTGC ACGCTGGTTT ACCCCGCCGG CAAAATCCAT TGATAACCCA
TTTGATTTAG CGGGCTGA
 
Protein sequence
MATGIGGCTA QQALEQLQPL TDNPPEIAGD EYRQRIQHAQ RLMRENGIDA CYVNAGSNLR 
YFTGTQWSAS ERMVGAVIPT EGEIAYIAPW FETGTFKDAQ VIEAEIFSWH EEEDPYRLFF
TVLASRGLTG ARQVAICETA SVTLFLGLQQ YAGDIRLISA QPITGHCRSR KSATEIVLMQ
TANNITLRVQ QAAASILRPG ITASELIDFV DKAHRKMGTS GSYFCIALFG SDSAFPHGVK
QPNPLQNNDI VLLDTGCRYK GYLSDITRTY VYGEANERQR FAWQAEHEAQ AAAFAVIAPG
VPCHKVDDAA RDVLVSYGFG PDYQLPGLPH RTGHGIGLDI HEAPYLIRKQ QQPLDVGMCA
SIEPMLCLPG EFGIRLEDHF YVTHEGARWF TPPAKSIDNP FDLAG