Gene Spro_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2007 
Symbol 
ID5603458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2197614 
End bp2198999 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content55% 
IMG OID640937545 
Productglycoside hydrolase family protein 
Protein accessionYP_001478238 
Protein GI157370249 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0536804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA CCGTTCTGGG CGGCGGCGGC GTGCGTTCGC CGTTCCTGGC CAAATCTATC 
GCCTACAACG CCCACCGTAT CGGCGTCACT GAAGTGGTGT TTATGGACAC CGATCAACAT
AAACTGGCCA TCTACGGTGC CATCGCTCAG GGGGTATTCC AGCGTATTCG CAGCGATATC
GCCTTCAGCC TGACCAGCGA TGCGCATCAG GCATTGAGCG GTGCAGACTA TATCATCACC
ACCCTGCGTA TCGGCGGCGA AGAGGGGCGT ATTGATGATG AACGCATCGC GCTCAACCAT
CAGGTTCTGG GGCAGGAAAC CACCGGTGCC GGCGGCTTCG CCATGGCAAT GCGTTCAATC
CCGGCGATTA TCGATTACTG CCGACTGATC GAACAGCTGT CGTCACCCGA TGCAGTGCTA
TTCAACTTCA CTAACCCTTC CGGTATGGTG ACCGAAGCTA TCATCAAGTC AGGCTTTAAA
CGCCAGGTGT ACGGTATCTG CGATGCGCCC AGCGAGTTTA TCCGCGAACT GGCCGAGTTG
TTGGGTTGCC GCGAGAGTGA ATTGAGTATC GACTGCTTTG GCCTGAACCA CCTGTCCTGG
TTCCGCAATG CCAGGGTCAA CGGTGAACCG GTAACCGAAC GGCTGCTGGC GGACCCGCGC
CTGTACCGCG ACACCTGCAT GAAATACTTC TCACCGGAGC TGGTCGAACT CTCCGATAAC
CTGATGCTCA ACGAGTATCT GTATTACTAC TACTATCGCG AGCAAGCGAT CGCCGCTATC
GTCAGCGCCG GAGAAACCCG CGGCGAGCAA ATTGCGCAGA TCAATCGGCA GATGCTGGCA
GACCTGGCCG AGCTGGACAT CCCGAACCAG CTGGATCAGG CCTTCAGCCT CTACTTCAGC
CATTATCTGA CGCGCGAAAA CTCGTATATG CAGCGCGAGT CCAACCAGGG CAAGGTGAAA
GAGCGCACCA TGCTGACGCT GCAACAGTTT ATCGAACAGC CGGACAGCGG TGGCTACGCC
GGGGTGGCGA TCGATATTCT GGAGGCGGTG AACAGCGGCC AACAAAAACG CGTGGTGGTG
TCGATGCAAA ACAACGACAC GCTGGACTTT CTGCATCCTG AGGACGTGAT CGAAATCAGC
TGTGAACTAA GCAGTGCGGG CATTCACCCG GTGAAAATGC GCGATATTCC CGATACGCAA
AAAAACCTGA TCGCTCGGGT GAAAGAGTAC GAACGGTTGG CAGTAGAAGC GATTCTTGAA
GGTAACCGTA AAAAAGCCAT CAAAGCCTTG ATGGTGCACC CGCTAGTGAA TTCTTACTCG
CTAGCGAAAA CGCTGGTGGA GGAGTATCTG CAGGCCCATC GGCAATATGC CGAACACTGG
CGTTAA
 
Protein sequence
MKLTVLGGGG VRSPFLAKSI AYNAHRIGVT EVVFMDTDQH KLAIYGAIAQ GVFQRIRSDI 
AFSLTSDAHQ ALSGADYIIT TLRIGGEEGR IDDERIALNH QVLGQETTGA GGFAMAMRSI
PAIIDYCRLI EQLSSPDAVL FNFTNPSGMV TEAIIKSGFK RQVYGICDAP SEFIRELAEL
LGCRESELSI DCFGLNHLSW FRNARVNGEP VTERLLADPR LYRDTCMKYF SPELVELSDN
LMLNEYLYYY YYREQAIAAI VSAGETRGEQ IAQINRQMLA DLAELDIPNQ LDQAFSLYFS
HYLTRENSYM QRESNQGKVK ERTMLTLQQF IEQPDSGGYA GVAIDILEAV NSGQQKRVVV
SMQNNDTLDF LHPEDVIEIS CELSSAGIHP VKMRDIPDTQ KNLIARVKEY ERLAVEAILE
GNRKKAIKAL MVHPLVNSYS LAKTLVEEYL QAHRQYAEHW R