Gene Spro_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2521 
Symbol 
ID5603871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2768743 
End bp2770413 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content53% 
IMG OID640938060 
Productalpha amylase catalytic region 
Protein accessionYP_001478750 
Protein GI157370761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGTG GGCAAACCGT GCGTTGGTGG AAACAGGCGG TGGTTTATCA GATCTATCCG 
CGCAGCTTTA TGGATTCTAA CGGCGATGGC ATTGGCGATC TGAATGGCAT CACCGACAAG
CTGGACTATT TGCAGTGGCT GGGCATTGAC GTGATTTGGA TCTGCCCGAT GTACCGTTCG
CCGAACGACG ACAATGGTTA CGATATCAGT GATTATCAGG CCATTATGAC GGAGTTCGGC
ACCATGGCGG ACTTTGACCG TTTGTTGGCC GAAGTGCATG CCCGCGGCAT GCGTTTGATA
CTGGATTTGG TGGTCAACCA CACTTCGGAT GAGCATCCGT GGTTTATTGA ATCCCGGTCA
TCTCAATCGA GTGCCAAACG CGACTGGTAT ATCTGGCGAG ACGGCAAAAA CGGCGCGGAA
CCCAACAACT GGGAAAGCAT CTTCAACGGT TCCGCCTGGA AATATGACCC CGGCAGCGAA
CAATACTTTC TGCACCTGTT CTCTGAACGT CAACCGGATC TGAACTGGGA GAATCCGCAG
GTGCGTACGG CGCTGTATGA CATGATGCGC TGGTGGCTGG ACAAGGGCAT TGACGGCTTC
CGCATCGACG CTATCTGCCA TATGAAAAAA CAGCCGGGGC TGACCGATAT GCCCAACCCG
CAGGGACTGC GTTATGTGCC GTCGTTCGAA CGGCATCTTA ACTACGACGG CCTGCTGGAC
TATGTCGATG ACCTGTGCGA GCAGGTATTT AACCGCTACG ACATCATGAC GGTGGGCGAA
ATGAACGGGG CCTCGGCACA GCAGGGCGAG GACTGGGTGG GGGAACAGCA CGGCCGCCTG
AATATGATCT TCCAGTTTGA ACACGTCAAA CTGTGGGAAA CCAGCCATCA GCAGTCACCG
GACGCTGGGC TCGATGTGAT GGGTCTGAAA GAGATCTTTA CCCGCTGGCA GACATTGCTG
GAAGGGAAAG GCTGGAACGC GTTGTACGTC GAAAATCATG ATATTCCGCG CGTGGTCTCT
AAATGGGGTG ACGATAAGCA GTATTGGCGT GAAAGTGCTA CCGCCATCGC CGCCATGTAT
TTCCTGATGC AGGGAACACC GTTTATTTAT CAGGGCCAGG AACTGGGGAT GACCAATACC
CGCTTCACGA GTCTGGCCGA TTTTAACGAT ATTGCCGCCA GAAACCGCTT TGCCAAACTG
CAGGTGCAGG GCATGGATGA AGCACAGATC CTGGCATTTC TCGGTCGTAG TGGGCGCGAT
AATTCACGAA CCCCAATGCA GTGGGACGAC GGGCCTCATG CTGGCTTCAG TACCGTTACA
CCGTGGTTCT CGTTGAATGC CAACTTTGAA CAGATTAACG TGGCGCGTCA GCGTAACGAG
CCGGATTCGG TGCTGAGTTT TTACCGGGCC TTGATCCGCC TACGCAAGCA GGATCCGATG
TGGGTCTATG GCCGCTACCA ATTACAACTG GCGGCGCATC CCCATATTTA CGCCTACAGC
CGTAGCCTGG ATCAACGGCA AGGGTGGGTA TTGTGCAACC TGAGTGGTGA AACACAACAA
ATTGACTCTC AACTCTTGCC ATTGGGAAAA AGATCATTGT TGTTGAGTAA TTATTCCCAC
CAGGGTGAAC GACAAATACT TCGTCCTTAT GAGGCACGAA TCTATCAATA G
 
Protein sequence
MSSGQTVRWW KQAVVYQIYP RSFMDSNGDG IGDLNGITDK LDYLQWLGID VIWICPMYRS 
PNDDNGYDIS DYQAIMTEFG TMADFDRLLA EVHARGMRLI LDLVVNHTSD EHPWFIESRS
SQSSAKRDWY IWRDGKNGAE PNNWESIFNG SAWKYDPGSE QYFLHLFSER QPDLNWENPQ
VRTALYDMMR WWLDKGIDGF RIDAICHMKK QPGLTDMPNP QGLRYVPSFE RHLNYDGLLD
YVDDLCEQVF NRYDIMTVGE MNGASAQQGE DWVGEQHGRL NMIFQFEHVK LWETSHQQSP
DAGLDVMGLK EIFTRWQTLL EGKGWNALYV ENHDIPRVVS KWGDDKQYWR ESATAIAAMY
FLMQGTPFIY QGQELGMTNT RFTSLADFND IAARNRFAKL QVQGMDEAQI LAFLGRSGRD
NSRTPMQWDD GPHAGFSTVT PWFSLNANFE QINVARQRNE PDSVLSFYRA LIRLRKQDPM
WVYGRYQLQL AAHPHIYAYS RSLDQRQGWV LCNLSGETQQ IDSQLLPLGK RSLLLSNYSH
QGERQILRPY EARIYQ