Gene Spro_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2571 
Symbol 
ID5604498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2827337 
End bp2828590 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID640938110 
Productextracellular solute-binding protein 
Protein accessionYP_001478800 
Protein GI157370811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCGA TAAAAAATAC CGGCCGTCTT CTGGCGCACT CTGCCATTGC GGTTGCATTA 
TTATTGCCGA CCATGTTATA TGCCGCCGAT CAAATAACCC TGCGTTACGC CGTGTGGGAC
AGGAATCAAT TACCTTCCGA GCAGGAGATT GCCAAAAGAT TTGAAAAAGA AAACCCTAAC
ATTAAAATCG CCATAGAATT AACGCCGTCG GCACAATACT TCGTTAAATT AGATTCCGCC
GCCGCAGGGG GTGTCGCGCC AGATATATTC TGGATAAACA TGCCTTATTT CGTTCAATAC
GCCAAAAATG GCATTATGCA ACCCTTAACG CCGTATATTA CCGGCAGCAG CGTGCAATTA
AATAATATCG TCGCCAGTTC GGTTAAAGCC TATCAGTATG ATGGGCAGCA AATGGCCATC
CCCCGCGACG TTGACTCAAT CGCCGTGTGG TACAACAAAA AATTATTCGA CCAGGCCGGC
GTCAGCTACC CGACCAACGA CTGGAGCTGG GATGACCTGA AAAACAAAGC CACCGCGCTG
AAAACCGGCT TGAAGGGCAG TGCCTTCCCG CTGGTGATGG ATCTGAGCAT TGACGGGCAG
GACAGCTATA TGAACCTGCT GTTCCAAAAC GGCAACCACA TAGTCCCGAA AGACGGTCAA
CCCACCGACA TCGCTAACGA TAAATCCATC TGGGTCTATC AACAGCTGCA ATCCATGATG
AAAGATGGCT TGATGCCCAG CGCCCAACAG ATGAGCGAAG TCAAAACCGA AAACATCTTC
CAGTCCAACC GTGCGGCGAT GGTGTATGCC GGCTCATGGC TGGCCGCCCC GTTCGCCAAC
AATCCGCTGA TCAACGACCA TATCGGCGTG GTCATGATGC CGAAAATCGA GCGCCAGTCC
GGCGTGGCGC ACAGCCTGGC ATTTGCCATG TCCGCCAACA GCGCCCACAA GCAGGAAGCC
TGGAAATACA TCGAATTTAT GAGCTCCGAA GCCTCGCAGA CCGAGTTGGC AAAAGCGGTG
ATCCCGGCCA ACAAACTGGC GGCCAAAGCC TGGGCGGCGG AGATCAAAAA AGTCGATGTC
ACACCTTATA TTGACACCCT CAACGTGACC GAAGCCTACC CCACAGCCGG TACCAATACG
CCGAAATGGC AAAACATGTG GATTGCCAGC CTAAAGAAAA TCTTTATGGG TGCGGACGCC
AAAGTCGAGA TGGACAAATC GGTCAAGAAG ATCGAACGCG TAATGGAGCA GTAA
 
Protein sequence
MSPIKNTGRL LAHSAIAVAL LLPTMLYAAD QITLRYAVWD RNQLPSEQEI AKRFEKENPN 
IKIAIELTPS AQYFVKLDSA AAGGVAPDIF WINMPYFVQY AKNGIMQPLT PYITGSSVQL
NNIVASSVKA YQYDGQQMAI PRDVDSIAVW YNKKLFDQAG VSYPTNDWSW DDLKNKATAL
KTGLKGSAFP LVMDLSIDGQ DSYMNLLFQN GNHIVPKDGQ PTDIANDKSI WVYQQLQSMM
KDGLMPSAQQ MSEVKTENIF QSNRAAMVYA GSWLAAPFAN NPLINDHIGV VMMPKIERQS
GVAHSLAFAM SANSAHKQEA WKYIEFMSSE ASQTELAKAV IPANKLAAKA WAAEIKKVDV
TPYIDTLNVT EAYPTAGTNT PKWQNMWIAS LKKIFMGADA KVEMDKSVKK IERVMEQ