Gene Spro_3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3157 
Symbol 
ID5605267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3475817 
End bp3477442 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content59% 
IMG OID640938700 
Productextracellular solute-binding protein 
Protein accessionYP_001479385 
Protein GI157371396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.821676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAATT TCTCTTTCAA TCGATTAACC AGCGGGATAA CAGTGGCATT GGGACTGGCG 
GCGGCGATGA ATCCGGCGCT GGCGTCGGTG CAGGGCGGCA CGCTTGTCTA CCTGGAACAG
CAGGCTCACA CCAACCTTTA TCCGCCTTCC GGCGGTTTCT ACCCCAACGG CGGTATCCTC
AATCAAATCA CCGACAAACT GACCTACCAG AACCCGCAAA CGCTGGAGAT TGAGCCGTGG
ATTGCCGAGT CCTGGAGCAG CAACGCCGAT AAAACCGAAT ACACCTTCAA GCTGCGGCCT
GGCGTGACTT TCTCCGACGG TACACCGCTG GATGCCAATG CAGTGGCGAA AAACTTCGAT
ACCTATGGCC TGGGCAATAA AGAGAAACGT CTGCCGGTGT CCGAAGTCAT CAATAACTAC
GACCACAGCG AAGTGATCGA CCCGTTAACC GTCAAGTTTT ACTTCAAGCA CTCGTCTCCC
GGCTTCCTGC AGGGCACCGC CACCATTGGT TCTGGCCTGG TTTCGCTCAG CACCCTGAAC
CGCAGCTATG ACCAACTCGG TGATGCGCGC CATATCATCG GTTCCGGCCC GTTTGTGGTC
AGCGCCGAGA CGCTGGGGCG TGAGGTCAGC CTGAGCGTTC GCAAGGATTA TCACTGGGGT
CCGGCCAAAT TGGCTCAGCA AGGGCGCGCT AATCTGGATG GCATCAAGGT GATTGTCACC
GGCGAAGACA GCGTGCGTAT CGGTGCGTTG CAGGCTGGGC AGGCGGACTT TATTCGCCAG
ATCCAGGCCT ACGACGAGAA GCAGACGCAG GAACAGGGCT TTACGATTTA CGCGGCCCCC
ACTCGTGGTG TCAACGACAG CGTCGCCTTC CGGCCGGATA ACCCGCTGGT GAGCGACCTG
CGCGTGCGTC AGGCACTGCT GCACGCCACC GACAGCAAGC AAATTGTCGA TACGCTGTTC
TCGGTTAACT ACCCACAGGC TAAATCGGTG ATTGCTTCTT CCGCCGCCGG TTTCGTCGAC
TTATCCGCCA AGCTGAAATT CGACCCGGAG CTGGCCAACC GTCTGCTGGA TGAGGCGGGC
TGGAAAAAGG GCGGCGACGG CCTGCGTGAG AAGGACGGCA AAAAATTGCT GCTGAATGTC
TATGAATCGC TGCCGCAGCC GCAGAACAAG GCGGTGCTGC AGCTGGTTTC GCAGCAGTGG
GGCAAGGTCG GCGCGCGCTT GAACATTCTG GCGGGCGACG CCGGCAGCAA GGTGGCGGAT
AACCTCGATC CGCAGAAAAC CCCGGCGGCG GTGGTGGAGG TGGGACGGGC GGATCCGGAC
GTGATTAAGA GCCAGTTCTA CCCGACCAAC CGCGATGCGC TGTTGCAGCA GGGCGGGACG
GGCAAAAACA GCGCATTCAA AGATGACAAG CTGAACGCGC TGCTGCTGGG CATCGCCTCT
GAGGTGGACC CGAAAAAACG CCTGCAGATT GCCGGTGAGG CGCAGAATTA CCTGCTTGAT
CAGGCCTATG TGATCCCGTT CTTCGAGGAG CCGCAGGTGT TTGCCGGTGC ACCTTATCTG
AAGGGGGTTA GTTTCGAAGC GGTCGGTCGC CCGAGTTTCT ACGGCGCCTG GTTAGAGAAA
CACTGA
 
Protein sequence
MFNFSFNRLT SGITVALGLA AAMNPALASV QGGTLVYLEQ QAHTNLYPPS GGFYPNGGIL 
NQITDKLTYQ NPQTLEIEPW IAESWSSNAD KTEYTFKLRP GVTFSDGTPL DANAVAKNFD
TYGLGNKEKR LPVSEVINNY DHSEVIDPLT VKFYFKHSSP GFLQGTATIG SGLVSLSTLN
RSYDQLGDAR HIIGSGPFVV SAETLGREVS LSVRKDYHWG PAKLAQQGRA NLDGIKVIVT
GEDSVRIGAL QAGQADFIRQ IQAYDEKQTQ EQGFTIYAAP TRGVNDSVAF RPDNPLVSDL
RVRQALLHAT DSKQIVDTLF SVNYPQAKSV IASSAAGFVD LSAKLKFDPE LANRLLDEAG
WKKGGDGLRE KDGKKLLLNV YESLPQPQNK AVLQLVSQQW GKVGARLNIL AGDAGSKVAD
NLDPQKTPAA VVEVGRADPD VIKSQFYPTN RDALLQQGGT GKNSAFKDDK LNALLLGIAS
EVDPKKRLQI AGEAQNYLLD QAYVIPFFEE PQVFAGAPYL KGVSFEAVGR PSFYGAWLEK
H