Gene Spro_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1103 
Symbol 
ID5605473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1213688 
End bp1215415 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content59% 
IMG OID640936622 
Productextracellular solute-binding protein 
Protein accessionYP_001477335 
Protein GI157369346 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000365076 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCTGA CCCACCGCCT CAGCCAGTAT CAACGCCTGT ACCAGCAGCT CGGCGGCACG 
CCGGTCGCCA TTACCGTTGG CGAACTGGCA GCGATGTTCT TTTGCAGCGA ACGCCATGCC
CGCACGCTGG TGCAACAACT GCAGGATCAG GGCTGGCTGA GCTGGCAGTC ACAGCCCGGG
CGCGGCAAAC GTGCGCACCT GCACTGCCTG AAAACGCCGG ATGAACTACG TGCAGCCCAT
CTGCAAAGGT TGCTGAAGGA AGGCGATCAT CAGGGAGCGC TGGAAATGGC GCAGTTGGAT
CCGCAACATT TGCAGGAGCT GCTCAGCCCA CATCTTGGGG GGCAATGGCA AGCCGGCAGC
CCAACGTTGC GTATCCCCTA CTACCGTACG CTCGAATCGC TGGATCCGCT GACCCTGACC
GGGCGCGCCG AACAGCATTT GGTCGCCACC CTGCATGCCG GCCTGACTCG CTTTAATACC
GGTAACTCCG AGCCGCAATC AGATCTGGCC CATCATTGGC AAATCAGCCA TGACGGCCTG
CGCTGGCAGT TCTTTCTACG CAGCCAATTG CGCTGGCACA ACGGTGAGCC GCTGACCGGC
CAGCAACTGC TGCAAACGCT GGAAAAACTG CAGCACCATC CTCGCAGTCA ACCGAGCCTG
GCCAACGTTG CCGCTATCAG CCTTCCCCAT GCGCTGTGCC TGCAGTTTGA TTTACAGTGT
CCGGACTACT GGTTGGCGCA CCGGCTGGCG GAGTTGCCTT GTCTGCTGAC CCATCCGGAG
GCTTCGGGCA TGGGAGCCGG CCCGTTTAAG CTGACGCTGA ACCAACCGCA CCTGGTCCGG
CTGGAGCAAC ACCCGTTTTA CCATTTGCAA CATCCTTTTC TTGAGAGCAT CGAGTATTGG
ATCACACCCG AGTTGAACAC CGGTGAGGCT TATACCAGTT GCCAGCATCC GGTACGCATT
ACCCTCGGTC AGCAGGAAGA GATAGCATTA GCGCGGCCGG TACAGCGCAG CATGAGTCTG
GGTTGGTGCT ATCTGGCGGT AAATTTGAAG CACGGCGTGC TCAGCGAGGC TCAGGGAAAA
AAACTGCTGA TGCTGATCCA GCAATCCGGC CTGTTGGCCA ATCTGCCGGT GCCCAACAGC
GTCATCACCC CCAGTAGTGA GATGCTGCCC GGCTGGGCAA TTCCACAACA TCAGGCCGCT
GAGGATATTG CCCTGCCCGC TAAATTGACG CTGTTGTATC GGCCCCCGGT GGAACTGGAG
ACGGTCACGG TGGCCTTGCA GCAACTGTTG GCGCAACACG GCTGCGAGCT GGAGTTGCGC
TACTACGCAG GTAAACGCTG GCAAAGCGCC GAACAAATCG AACAGGCCGA CCTGCTGCTG
GCGGATAATC TGATTGGAGA GTCACCGGAG GCCACGCTGG AAAGCTGGCT ACGGCAAGAT
ACGCTGTGGC GCGGTATCCT GCCCGACGCT CGCTGGCAGC ACCAGCAACA AACGCTGCAA
CAGATCCAGC AGCTCCCGGC GCAGCAAACG CGTTACCAGC AATTGAAAGA CTATTATCAG
CAGCTTATGG CGGCGGCGAT CATCACTCCG CTGTTTCACT ATCAGTATCA AATCAGCGCG
CCGCCGCGCA TTCATGGCGT CACCCTGACC GCCCACGGTT GGTTCGATTT CTGTCAGGCC
TGGCTGCCAC CGCCGGTAGA AAATACCACG CCTCCATCAA CCGACTGA
 
Protein sequence
MRLTHRLSQY QRLYQQLGGT PVAITVGELA AMFFCSERHA RTLVQQLQDQ GWLSWQSQPG 
RGKRAHLHCL KTPDELRAAH LQRLLKEGDH QGALEMAQLD PQHLQELLSP HLGGQWQAGS
PTLRIPYYRT LESLDPLTLT GRAEQHLVAT LHAGLTRFNT GNSEPQSDLA HHWQISHDGL
RWQFFLRSQL RWHNGEPLTG QQLLQTLEKL QHHPRSQPSL ANVAAISLPH ALCLQFDLQC
PDYWLAHRLA ELPCLLTHPE ASGMGAGPFK LTLNQPHLVR LEQHPFYHLQ HPFLESIEYW
ITPELNTGEA YTSCQHPVRI TLGQQEEIAL ARPVQRSMSL GWCYLAVNLK HGVLSEAQGK
KLLMLIQQSG LLANLPVPNS VITPSSEMLP GWAIPQHQAA EDIALPAKLT LLYRPPVELE
TVTVALQQLL AQHGCELELR YYAGKRWQSA EQIEQADLLL ADNLIGESPE ATLESWLRQD
TLWRGILPDA RWQHQQQTLQ QIQQLPAQQT RYQQLKDYYQ QLMAAAIITP LFHYQYQISA
PPRIHGVTLT AHGWFDFCQA WLPPPVENTT PPSTD