Gene Spro_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1049 
Symbol 
ID5606692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1154333 
End bp1156156 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content59% 
IMG OID640936568 
Productmaltodextrin glucosidase 
Protein accessionYP_001477281 
Protein GI157369292 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0895639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACG CCTGGCATCA ACCGGTCCCG CCCTTCGTGG TGAAGAAGGG GCAACGTCTT 
GATATCACGC TGTGGTTACA GGGTGATGAT GTGCCCGAAC GGGTGTTTTT ACGCGCCGAA
CCGGATAATG AAGAATGGTT GCTGACGATG AAAGTGCAGC ATATTGACGG GCTGCATCGC
TATCAGGCCA GTTTGACGCT CAATGAGGGG GAGCCAACCC GTCGTTACTG CTTCAAGCTG
GTTTGGGCTG CCAGTCAGCA ATGGTTCGGC CCCCAGGGTT GGTCGCTGAC GCCGCCGGGG
CAACTGGCGC AGTTTGCTGT CGACGTGCCC GACCGCAGCC CGGCCTGGGT GGCGGATCAG
GTTTTCTATC AAATTTTCCC CGATCGCTTC TCCAGTAGCG GGGGGGAACA TTGCATTCAA
AGTGGCAGTT ATCGCCATCA TGCGGCCGGT TGCGATGTTG TGCGTCAGGA TTGGCACCAT
CCGTTGGAAG ACCGGCATGC CGCGTCCACC TTCTATGGTG GCGATCTGGA TGGCATCAGC
CAAAAACTGC CTTATCTACA GCAGCTGGGC GTGACGGCGC TGTATCTGAA TCCGATTTTC
ACCGCGCCCA GCGTACATAA ATACGATACC GAAGATTATT ACCAGGTCGA TCCCTATCTC
GGCGGCAACG CCGCCCTGCA GCGTTTGCGA GTGAGTACTC ACAAGGTGGG GATGAAACTG
GTGCTGGACG GGGTGTTTAA CCATACCGGC GATTCGCATC CGTGGTTTGA CCGCCACCAG
CAAGGGGATA ACGGCGCTTG CCATCACCCC GATTCACCTT ACCGTGGCTG GTTCAATTTT
TACCCCGATG GCCGCGCGCT CGACTGGAAA GGGAATGCCA GTCTGCCGAA GCTTAACTTC
GCTGAACCAC AGGTAGCGGA AGCGATTTAT CGCGGTGACG GTAGCGTGGT GCGTCACTGG
TTACGGCCGC CGTACAACAT CGACGGTTGG CGGCTGGATG TGGTGCACAT GCTGGGTGAG
AACGGGGGCG CTACCGGCAA TTTGCAGCAT CTGGCGGGCA TTTATCAGGC GGTTAAACAG
GAAAACCCGC AGGCCTACGT GCTGGGCGAG CATTTTGGCG ACGCTCGTCG CTGGCTGCAT
GCCGGCGTGG AAGATGCGGC AATGAATTAC ATGGGCTTTG CCCTGCCGGT GCGGGGGTTC
CTCGCCGGGC TGGACGTGGC GCACCACCCG GTGCAGCTGG ATGCGGCGGA CTGCGCCCAG
TGGATGGACG GTTATCGGGC CGGGCTGCCG CATGGCCGTC AGTTGATTCA GTTCAACCAG
CTCGACAGCC ATGACACTGC GCGCTTTCTG ACGCTATTAC AGGGCAATCA GGCGCGGATG
CGCATGGCGG CGGTGTGGCT GATGAGCTGG ATAGGCGTAC CCTGCCTATA TTATGGCGAC
GAGATTGGTC TGGATGGGGC CAACGACCCG TTCTGTCGCA AGCCTTTTCC GTGGGATGAA
AGCCAGTGGG ATCAGAACCT GTTGGCCCTG TACCAGCGCA TGGCGGCGCT GCGCAAACAG
AGCCTGGCGC TGCGTCGCGG CGGTTGTCAG GTGCTGCATG CCGCCGCTGA AACGCTGGTA
TTTGTGCGTA TTTACCAACA GGAACAGGTG TTGGTGGCGT TACAGCGTGA CGGCAGCGGC
AGCGTGACCC TGCCACACGG CCCGTTGCTG GCCGCAGGCC AATGGCAACG GCTGGAAGGG
GATGGCGAGT TGAATGACAC GCAAAACGCG ATCTCGCTGC AACTGGGCAA AGAGACGGTA
AGCCTGTGGC GGTTAACAGG CTAA
 
Protein sequence
MLNAWHQPVP PFVVKKGQRL DITLWLQGDD VPERVFLRAE PDNEEWLLTM KVQHIDGLHR 
YQASLTLNEG EPTRRYCFKL VWAASQQWFG PQGWSLTPPG QLAQFAVDVP DRSPAWVADQ
VFYQIFPDRF SSSGGEHCIQ SGSYRHHAAG CDVVRQDWHH PLEDRHAAST FYGGDLDGIS
QKLPYLQQLG VTALYLNPIF TAPSVHKYDT EDYYQVDPYL GGNAALQRLR VSTHKVGMKL
VLDGVFNHTG DSHPWFDRHQ QGDNGACHHP DSPYRGWFNF YPDGRALDWK GNASLPKLNF
AEPQVAEAIY RGDGSVVRHW LRPPYNIDGW RLDVVHMLGE NGGATGNLQH LAGIYQAVKQ
ENPQAYVLGE HFGDARRWLH AGVEDAAMNY MGFALPVRGF LAGLDVAHHP VQLDAADCAQ
WMDGYRAGLP HGRQLIQFNQ LDSHDTARFL TLLQGNQARM RMAAVWLMSW IGVPCLYYGD
EIGLDGANDP FCRKPFPWDE SQWDQNLLAL YQRMAALRKQ SLALRRGGCQ VLHAAAETLV
FVRIYQQEQV LVALQRDGSG SVTLPHGPLL AAGQWQRLEG DGELNDTQNA ISLQLGKETV
SLWRLTG