Gene Spro_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4221 
Symbol 
ID5603175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4680442 
End bp4681845 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content57% 
IMG OID640939781 
ProductBeta-glucosidase 
Protein accessionYP_001480443 
Protein GI157372454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0294315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAT TTCCGAAGGA TTTCCTGTGG GGCGCGGCGA CCGCGTCTTA CCAGGTTGAG 
GGCGGCTTTG ATGCCGACGG CAAGGGCCTG TCCAACTGGG ACTTGTTCTC CCACCTGCCC
GGCACCACTT ATCAGGGTAC CAACGGCGAC GTCGCGGTCG ATCACTACCA TCGCTTTCGC
GAAGACGTAG CGCTGATGGC CGAATTGGGG ATGCAGACCT ACCGATTTTC GATCTCGTGG
CCACGGTTGC TGCCGCAGGG GCGGGGCGAG GTGAATGAGG CCGGGATCCA ATTCTACAGC
GATCTGATCG ACGAACTGTT GAAGCACAAC ATCAAACCGA TGATCACCCT GTACCACTGG
GATCTGCCGC AGGCGCTGCA AGAAGAATTT GGCGGTTGGG AATCGCGTGA GATCGTCGAT
GCTTTCGATG AATATGCCCG TCTGTGTTAT CAGCGTTTCG GCGACCGCGT CGAGCTGTGG
TCTACCTTTA ACGAAACCAT CGTGTTTATC GGCATGGGCT ATATCACCGG AGCGCATCCG
CCCAAGTTGA CCGATCCGAA GAAGGGCATT CAGGCCTGTC ACCATGTGTT CCTGGCCAAT
GCCCGCGCGG TAAAAAGCTT CCGCGAAATG AAGATCAACG GTCAGATCGG CTTCGTCAAC
GTGCTGCAAC CTAACGATCC GATCAGCGAC TCGCCAGAAG ATCGCCGCGC CTGCGAGTTA
GCCGAGGGGA TCTTCACCCA CTGGCTGTAC GATCCGGTGT TGAAGGGCGA ATACCCGGCA
GAGCTGTTGG CGATGGCGCA GCAGGCCTTT GGCGTACCTT ATTTTGCACC GGGCGATGAG
GCGTTGCTGA AGGGCAACAT CGTCGATTTT ATCGGTCTTA ATTACTACAA GCGCGAAATG
GTGGCACATA ACGACGATGT CGAGGGCTAC GCGATCAATA CCAGTGGCCA GAAGGGCAGC
GGGCGTGAAC TGGGCTTTAA GGGGCTGTTC AAACTGGTGC GCAACCCGAA CGGGGTTTAT
ACCGACTGGG ACTGGGAGGT TTATCCGCAG GGGCTGACCG ATGCCATTGG CCGCATCGTC
AAACGCTATG GCAACATTCC GATCTACATT ACCGAGAACG GGTTGGGTGC CAAGGATCCG
ATCGTCGAGG GGGAAGTGCG CGATCAACCG CGCATAGACT ATCTGCGCGA TCATATTCAG
GCGATCGGTG CGGCGATCGA GCAGGGTGCC GATGTGCGCG GTTACTACCC CTGGTCGTTT
ATCGATCTGC TTTCCTGGCT CAACGGCTAT CAGAAGCAGT ACGGCTTTGT GTATGTCGAT
CACGACAACA ATCTGGCGCG CAAGAAGAAG CAGAGTTTTG GCTGGTATCA GCGGGTGATC
GCCAGCCACG GTGAGCAGCT GTAA
 
Protein sequence
MSVFPKDFLW GAATASYQVE GGFDADGKGL SNWDLFSHLP GTTYQGTNGD VAVDHYHRFR 
EDVALMAELG MQTYRFSISW PRLLPQGRGE VNEAGIQFYS DLIDELLKHN IKPMITLYHW
DLPQALQEEF GGWESREIVD AFDEYARLCY QRFGDRVELW STFNETIVFI GMGYITGAHP
PKLTDPKKGI QACHHVFLAN ARAVKSFREM KINGQIGFVN VLQPNDPISD SPEDRRACEL
AEGIFTHWLY DPVLKGEYPA ELLAMAQQAF GVPYFAPGDE ALLKGNIVDF IGLNYYKREM
VAHNDDVEGY AINTSGQKGS GRELGFKGLF KLVRNPNGVY TDWDWEVYPQ GLTDAIGRIV
KRYGNIPIYI TENGLGAKDP IVEGEVRDQP RIDYLRDHIQ AIGAAIEQGA DVRGYYPWSF
IDLLSWLNGY QKQYGFVYVD HDNNLARKKK QSFGWYQRVI ASHGEQL