Gene Spro_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2347 
Symbol 
ID5603870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2558660 
End bp2560210 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content57% 
IMG OID640937886 
Productextracellular solute-binding protein 
Protein accessionYP_001478576 
Protein GI157370587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.33855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA TCAAATACGC CGCTCTGGCG GCTGCAGTGA CGTTGGGCAT GGCGGTAATG 
GGTCAGGCGC AGGCCGCTGT ACCCAAAGAC ATGCTGGTGA TTGGCAAGGC GGCCGACCCG
CAAACGCTGG ATCCGGCGGT GACCATAGAT AACAACGACT GGACGGTGAC CTATCCGGCC
TACCAACGAC TGGTACAGTA CAAAACCGAG GGCGGCAAAG GCTCGACTCA GGTGGAGGGC
GAGCTGGCAG AAAGCTGGAC GGCTTCTGAC GATCAGCTGG TGTGGACCTT TAAACTCAAA
CCCGGCAACA AGTTTGATGA TGGTTCGGAC GTTAATGCCG AGGCGGTGAA GTGGTCGTTC
GAACGCCTGA TGAAAATTGG TCAGGGGCCT TCCGAAGCTT TCCCGAAAGA CTTGCAGGTG
ACGGCGGTGG ATCCGCTAAC CGTGCGGTTC ACTCTGAAAA CCCCGTTTGC ACCGTTCCTG
TACACCCTGG CCAACGACGG GGCCGGTATT GTTAACCCGG CCATCGCCAA GGCTAATCCG
GCGGACGAAG GTAAAGCCTG GCTGGCGAAC CACAGCGCCG GTTCCGGCCC GTACAAATTG
GATCGTTGGC AAAAAGGCCA GCAGCTGGTG CTGGTACCGA ATCCGCATTA CAGCGGCGCC
AAACCGGCCT TTAAGCGGGT AACGGTGAAA ATCATCGGCG AAAGCGCCAC CCGTCGTTTA
CAACTGACCC GTGGTGACCT GGATATTGCC GAATCCTTGC CGATCGACCA GCTTACGGCG
TTGAAGAGCG AAAACAAGGT GGCGGTCAAT GAGTATCCGT CGCTGCGGGT GACCTACTTA
TATCTCAACA ACGGCAAAGC GCCGCTGAAT CAGGTTGATC TGCGCCGCGC CATTTCCTAC
GCGGTGGATT ACCAGGGCAT GGTGAAGGGC ATTCTCGGCG GTAACGGCAA ACAGATGCGT
GGCCCGATCC CGGAAGGCAT GTGGGGATAC GACGCCACCG CGCAGCAATA TAGCCAGAAT
GCGGACAAGG CCAAAGCAAC ACTGGCGGCG GTGAAGGACA AACCGGCTTC GCTCAATTTC
CTGTATTCGG TAAGCGACCC CAACTGGGAG GCGATAGCGC TGTCGGTACA GGCCAGCCTG
GCGACGGTGG GCATCAACGT CAAGCTGGAG AAACTGGCCA ACGCCACCAT GCGTGACCGT
ATCGGCCAGG GTAACTACGA CATCGCCATC GGCAACTGGA GCCCGGACTT CGCCGACCCC
TACATGTTCA TGAACTACTG GTTCGAATCC GACAAGAAAG GACTGCCGGG CAACCGTTCG
TTCTACAGCG ATCCGCAGGT GGATGCGCTG CTGAAGAAAG CGGTAGCGGT GTCCGATCAG
CAGGTGCGTA CTGCCGATTA CCAGGCGGCG CAGAAAATTG TCATCGACCA GGCGGCTTAT
GTTTATCTGT TCCAGAAAAA CTATCAGGTG GCGATGAACA AAGAGGTGAA GGGCTTTGTA
TATAACCCGA TGCTGGAGCA GGTGTTCAAC GTTGGGCAGA TGAGCAAGTA A
 
Protein sequence
MNQIKYAALA AAVTLGMAVM GQAQAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPA 
YQRLVQYKTE GGKGSTQVEG ELAESWTASD DQLVWTFKLK PGNKFDDGSD VNAEAVKWSF
ERLMKIGQGP SEAFPKDLQV TAVDPLTVRF TLKTPFAPFL YTLANDGAGI VNPAIAKANP
ADEGKAWLAN HSAGSGPYKL DRWQKGQQLV LVPNPHYSGA KPAFKRVTVK IIGESATRRL
QLTRGDLDIA ESLPIDQLTA LKSENKVAVN EYPSLRVTYL YLNNGKAPLN QVDLRRAISY
AVDYQGMVKG ILGGNGKQMR GPIPEGMWGY DATAQQYSQN ADKAKATLAA VKDKPASLNF
LYSVSDPNWE AIALSVQASL ATVGINVKLE KLANATMRDR IGQGNYDIAI GNWSPDFADP
YMFMNYWFES DKKGLPGNRS FYSDPQVDAL LKKAVAVSDQ QVRTADYQAA QKIVIDQAAY
VYLFQKNYQV AMNKEVKGFV YNPMLEQVFN VGQMSK