Gene Spro_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3103 
Symbol 
ID5604553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3414186 
End bp3415493 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content52% 
IMG OID640938643 
Productglycoside hydrolase family protein 
Protein accessionYP_001479331 
Protein GI157371342 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.199701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAG TAACTTTTAT GGGCGCAGGC AGCACCATCT TCGCCAAGAA CGTACTCGGC 
GATATCATGG CAACGCCGGC ATTGAAAGAG GTGGATATCG CGCTGTACGA CATTGATAGC
GCCCGTCTCA ATGAATCTTT CGCCATGCTC AGCAATATCA ATCGCAATAT TAACGCTGGC
AGGGCGAAGA TCACTCCCTA CCTCGGGGTG GAAAATCGTC GTTCGGCGTT GAAAAATGCC
AATTATGTGG TTAACGCCAT CCAGGTTGGC GGTTACGATC CTTGCACTAT CACCGATTTC
ACCATTGCTA AAAAGTATGG CCTGCAGCAA ACCATTGCCG ATACCCTGGG GATAGGTGGC
ATCTTCCGTG CGCTGCGCAC CATTCCGGTG ATGTTTGACT TCGCCCGGGA TATTGAAGCG
GTATGTCCGG ATGCCTGGTT GCTGAACTAC ACCAACCCGA TGGCGGCCTT AACCGGCGCC
ATGCTGCGCC ATACCGAAGT GAAAACGGTG GGGTTGTGTC ACAGTGTGCA GGTCTGTGCA
GAGACGCTGC TGAAAAGCGT GGATATGCCT ACCGATGATG TCCAGTTCCA CATCGCAGGC
ATTAACCATA TGGCCTGGTT GCTGGACGTT CGTCGCCATG GCGAGGATCT GTACCCGGAA
ATCAAGCGTC GCGCCAATGC GCTGCAGGGC AAACATGATG ATATGGTGCG CCATGAAATC
ATGAAAACCT TTGGCTATTA CGTTACCGAG TCTTCGGAAC ATAACGCCGA GTACATGCCT
TATTGGATCA AGCGTAACTA TCCTGAATTG ATTGAGCGCT TTAACATTCC GCTGGACGAG
TACCCGCGCC GCTGTGTTGA GCAGATTGAA CAATGGCAAC AGCGCAAGCT GGCGCTGACT
AATGACGCCA ACCTGACTCA TACCCGCACT CATGAGTATG CGTCTTATAT TATTGAAGCG
ATGGAAACCG ATCGCCCGTA CAAGATTGGT GGCAATGTGC TCAACAGCGG TTTAATTACC
AACCTGCCTG CCGAGGCCTG TGTTGAAGTG CCTTGCCTGG TGGATGGGCA GGGCATCTCG
CCCTGTTACG TCGGTCATTT ACCGGAGCAA CTGGCGGCGC TCAACCGCAC CAATATCAAT
ACCCAACTTC TGACTATCGA AGCCGCAGTA ACCCATAAAC GCGAAGCGAT TTACCACGCG
GCACTGCTGG ATCCGCATAC ATCTGCCGAG CTTTCGATTG ATGATATCCG TAAACTCTGC
GATGAACTGA TTGAGGCCCA CGGTAACTGG CTTCCCGCCT ACCACTGA
 
Protein sequence
MIKVTFMGAG STIFAKNVLG DIMATPALKE VDIALYDIDS ARLNESFAML SNINRNINAG 
RAKITPYLGV ENRRSALKNA NYVVNAIQVG GYDPCTITDF TIAKKYGLQQ TIADTLGIGG
IFRALRTIPV MFDFARDIEA VCPDAWLLNY TNPMAALTGA MLRHTEVKTV GLCHSVQVCA
ETLLKSVDMP TDDVQFHIAG INHMAWLLDV RRHGEDLYPE IKRRANALQG KHDDMVRHEI
MKTFGYYVTE SSEHNAEYMP YWIKRNYPEL IERFNIPLDE YPRRCVEQIE QWQQRKLALT
NDANLTHTRT HEYASYIIEA METDRPYKIG GNVLNSGLIT NLPAEACVEV PCLVDGQGIS
PCYVGHLPEQ LAALNRTNIN TQLLTIEAAV THKREAIYHA ALLDPHTSAE LSIDDIRKLC
DELIEAHGNW LPAYH