Gene Spro_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1371 
Symbol 
ID5606048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1500348 
End bp1502645 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content58% 
IMG OID640936903 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001477603 
Protein GI157369614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGC TTTGTGTGGT GAGTATGTTG TCCGGTCTGG CCATCAGCCC GGTTTTTGCG 
CAGGAAAGCG CCGTGATCCA GGGCGTTAAT GCCCAGCAGC GCGACGCCTT TGTCAGCAAC
CTGATGAAAC AGATGACGCT GGAAGAAAAA ATCGGCCAGC TGCGGTTAAT CAGCGTCGGA
CCGGATAATC CGAAAGAGGC GATCCGCGAC GGCATCAGCA AGGGGCAGAT TGGCGCCATC
TTCAATACCG TCACCCGGCC GGATATTCGC GCCATGCAGG ATCAGGCGAT GCAGCTCAGC
CGCCTGAAAA TTCCGTTGTT CTTTGCCTAC GACGTGGTGC ACGGCCAACG CACCGTGTTC
CCGATCAGCC TTGGGCTGGC GGCCAGTTTT GATCTCGAAG CCATCGCCCT CAGTGGCCGG
GTATCGGCGC AGGAGGCCAG CGACGACGGC CTGAACATGA CCTTCTCGCC GATGGTCGAC
ATCACCCGCG ACCCGCGCTG GGGCCGGGTC TCGGAAGGCT TCGGCGAAGA TACCTGGCTG
GTATCCAAAA TTGCCAAAGT GATGGTGGAT GCCTATCAAA ACGGCGACCC GTCCAAGCCC
GGCTCAATCA TGGCCAGCGT GAAGCACTTC GCGCTGTACG GTGCCACCGA GGGTGGCCGC
GATTACAATA CCGTCGATAT GAGTCCACTG AGAATGTACC AGGACTACCT GCCGCCTTAT
AAGGCCGCGG TTGACGCCGG CAGCGGTGGG GTGATGGTCT CGCTCAATTC GATAAACGGC
ATTCCGGCTA CCGCTAACCC CTGGTTGCTG AAAGATTTGC TGCGCGACCA GTGGGGCTTC
AAGGGCATCA CCATCAGTGA CCACGGTGCG ATTAAAGAGT TGATCAAACA CGGCGTGGCC
GCAGATGCGC GCGATGCGGT GCGCCTGGCG ATTACCTCCG GCGTCGATAT GAGCATGAGC
GACGAATATT ACGACCAATA CCTACCGGGG CTGGTGAAGG ACGGGCTGGT ATCGGAAAGT
GACATCGACC GTGCCTGTCG CGACGTGCTG AATACCAAAT ATGATATGGG CCTGTTTAAA
GACCCTTACA CCCACTTGGG GCCGGTCGGT TCCGATCCGC AGGACACTAA CGCCGAAAGC
CGTTTGCATC GTGCCGAAGC GCGCGTAGTT GCGCGTAAAA CCATGGTGCT GTTGAAGAAT
GATAAGCAGA CGCTGCCACT CAGCAAGCAG GCGACCATTG CGCTGGTCGG GCCGATGGCC
GACAGCCAGC GTGATGTGAT GGGCAGTTGG TCGGCGGCCG GGGTGATTAA ACAGTCGGTC
ACCCTGCGTG AGGGTCTGGA ACGTGCGGTG GGCGACAAGG CTAAAATCCT CTACGCCAAG
GGCGCCAACG TCACTCAGGA CAAGGGCATT ATCAACTATC TGAATGAATA TGAGCCGGCG
GTGGCGTTTG ATACTCGTCC ACCGCAGCAG ATGATTGACG AAGCGGTGCA GGCGGCGAAG
AAAGCCGATG TCGTGGTGGC AGTGGTGGGG GAATCGCAGG GCATGGCTCA CGAGGCTTCC
AGCCGCGCCG ACATTACCAT TCCACAAAGC CAACGTGACC TAATCGCCGC GCTGAAAGCA
ACCGGCAAAC CGCTGGTGCT GGTGCTGATG AATGGCCGAC CGCTGGCGCT GAGCTGGGAA
AGTCAGCAGG CGGATGCGAT GCTGGAAACC TGGTACAGCG GCACCGAGGG CGGCAATGCG
GTGGCCGACG TACTGTTTGG CGACTACAAC CCGTCGGGCA AGCTGCCGAT GACCTTCCCG
CGTTCTGTCG GGCAGATCCC GATGTACTAC AACCACCTGA ATACCGGCCG TCCGTTCGGC
AAGGAAAACC CGGGTAAATA TACCTCCCGC TACTTCGACT CACCGAACGG CCCGCTGTAT
CCGTTTGGTT ATGGCCTGAG CTACACCAGC TTTAGCCTGT CGGATCTGAA ACTCTCCAGC
CCGACGATGG CGCGCAACGG CAAGATCACC GCCAGCGTCA CGCTCAAGAA CACCGGTAAA
TATGATGGTG CCACCGTGGT GCAGTTGTAT CTGCAGGACG TGACCGCCTC GGTCAGTCGC
CCGGTAAAAG AGCTGCGTAA CTTTAAAAAG GTGATGCTGA AAGCGGGGCA GGCGCAGAAG
GTGGAGTTGC CTATTACCGA AGAGGATCTC AAGTTCTATA ACGCCAGCCT GAAATGGGGC
GCGGAACCGG GCAAGTTTAA TGTGTTTGTC GGCCTGGATT CCGACGACGT ACAGGCGCAG
AGCTTTACGC TGAAGTAA
 
Protein sequence
MKWLCVVSML SGLAISPVFA QESAVIQGVN AQQRDAFVSN LMKQMTLEEK IGQLRLISVG 
PDNPKEAIRD GISKGQIGAI FNTVTRPDIR AMQDQAMQLS RLKIPLFFAY DVVHGQRTVF
PISLGLAASF DLEAIALSGR VSAQEASDDG LNMTFSPMVD ITRDPRWGRV SEGFGEDTWL
VSKIAKVMVD AYQNGDPSKP GSIMASVKHF ALYGATEGGR DYNTVDMSPL RMYQDYLPPY
KAAVDAGSGG VMVSLNSING IPATANPWLL KDLLRDQWGF KGITISDHGA IKELIKHGVA
ADARDAVRLA ITSGVDMSMS DEYYDQYLPG LVKDGLVSES DIDRACRDVL NTKYDMGLFK
DPYTHLGPVG SDPQDTNAES RLHRAEARVV ARKTMVLLKN DKQTLPLSKQ ATIALVGPMA
DSQRDVMGSW SAAGVIKQSV TLREGLERAV GDKAKILYAK GANVTQDKGI INYLNEYEPA
VAFDTRPPQQ MIDEAVQAAK KADVVVAVVG ESQGMAHEAS SRADITIPQS QRDLIAALKA
TGKPLVLVLM NGRPLALSWE SQQADAMLET WYSGTEGGNA VADVLFGDYN PSGKLPMTFP
RSVGQIPMYY NHLNTGRPFG KENPGKYTSR YFDSPNGPLY PFGYGLSYTS FSLSDLKLSS
PTMARNGKIT ASVTLKNTGK YDGATVVQLY LQDVTASVSR PVKELRNFKK VMLKAGQAQK
VELPITEEDL KFYNASLKWG AEPGKFNVFV GLDSDDVQAQ SFTLK