Gene Spro_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0145 
Symbol 
ID5602957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp156247 
End bp157815 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content59% 
IMG OID640935632 
Producthypothetical protein 
Protein accessionYP_001476383 
Protein GI157368394 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT CTTTCTCATT AGGTATTCGG CAGGTTTGGG AAGAACTGTC TGTCATGCAG 
GCCCCGGGTC TGTATTGGGT GAATATCGAC CGTCAAACCG ATGCTAACCT GCTTTGCCAA
CAAACCCTGG CCGCCCAGGC CGCAACCAGC CAGGTGGCAT TGATTTGCAG TGGGGATAAA
CCTGACCGAC TGCTGGCCGA ACTTGCCTCA CCGGCGCTGA AAAAAATTCC GCTGTACCAG
CTGCCGGAAA AAAAAGCCGC GCTGACGCAT TTCAGCGACG ACCTGATGCG CGCCTTAAAA
CCCCAAAACC GCTTATTAAT CCTGTTGGCC CATGCCAGTT TGTGGCAAAC CTTCACCAGT
GAAGAACTGC GCGACTGGAC CCGCACTACC GGCGCCTGGC TACGCCGACA AGGCTGCACG
CTGCTGATCC TCAGCCACGG CGGCGGCGTC AACAAACTCA AGGGGCAACT CAGCGCCCAG
CACCGGATCC TTAACGGCCT GTCCAGCCTG CAGTGGCAAC AGGACAGCGC ACAGTATTTG
GTTAACTGGT GGAGTACCGA AAACGGCATC AATGCCAACC AACTGCTGAC GCTGTATGCC
GGTGAAAACG GCTGGCAGGG TGACGATGAC AACAACAAAC CCTCGCCAAC CTCCATTCGC
AGCGATGAAG GACTTTATCT GGCCCAACAG AGCATTCTGG AAGGTGCGCC GCCGCTGTCG
GCCAACTGGC AACTGCTGGA AAGTAATGCG CTACTGGCGC AGCACGGCAT GCTGACACTC
TCGGCAACGT TGATTTTTGC GCTGTACCAA AGCGATGAAA TCGATGCACT GGCACACCAA
ATCCACAGCC TGCGCCGCCA ACGCGGCAAC GGGCTGAAAA TCGTGGTGCG CGAGATGAGC
GCCAGCCTGC GTTACAGCGA CGAACGCCTG CTGCTGGCCT GCGGCGCCAA CCTGATTGTG
CCGCACGTGG CACCGCTGTC GCGCTTTCTG ACCATGCTTG AAGGCATCCA GGGGCAGCGC
TTCTCACGCC ACGTTCCGGC AGATATCGAC GCCCTGCTGG CCGGGTTGCG GCCGTTGCAG
TTGAAGGGCT ACATGCGGCC GGATGATTTT AGCCAGGCAG TCATTTCGCT GATGGGCAAC
ACGCTGTTGC CGGAGGACGG TAAAGGCGTG ATGGTCGCAC TGCGCCCGGC GCCGGGGCTG
CGTGCCGAAC AGGCGATGAC CCTGTGTTTT TTGCGCCGCT TCGGCGACGT GATGACCGTG
GTGCAAGGCC GGCTGGTGCT GTTCCTCTCC ACCTGCCGAA TTAACGATTT GGATACCGCC
CTGAAATTTA TCTTCCGCCT GCCGGTAGAC GAAGCCTTCA GCAACCGGGT GGTCTGGCAT
CAGGACGTAG ATATTATTTC AGAAATCAAA CGGCTGGCGC ACAACGCCCC GGTTCCGTTG
GGCGCGGCGA CCGCAGCGGT GGCACCACGT CAAACCGCCG CCGAGGCAGC AAGCCCGCAG
GAACGCCGTC AACCGGTGGC CTTCACGCTA ACCACCACGG CACAGGAGAA CAAGCATGTT
GAACCTTAG
 
Protein sequence
MAQSFSLGIR QVWEELSVMQ APGLYWVNID RQTDANLLCQ QTLAAQAATS QVALICSGDK 
PDRLLAELAS PALKKIPLYQ LPEKKAALTH FSDDLMRALK PQNRLLILLA HASLWQTFTS
EELRDWTRTT GAWLRRQGCT LLILSHGGGV NKLKGQLSAQ HRILNGLSSL QWQQDSAQYL
VNWWSTENGI NANQLLTLYA GENGWQGDDD NNKPSPTSIR SDEGLYLAQQ SILEGAPPLS
ANWQLLESNA LLAQHGMLTL SATLIFALYQ SDEIDALAHQ IHSLRRQRGN GLKIVVREMS
ASLRYSDERL LLACGANLIV PHVAPLSRFL TMLEGIQGQR FSRHVPADID ALLAGLRPLQ
LKGYMRPDDF SQAVISLMGN TLLPEDGKGV MVALRPAPGL RAEQAMTLCF LRRFGDVMTV
VQGRLVLFLS TCRINDLDTA LKFIFRLPVD EAFSNRVVWH QDVDIISEIK RLAHNAPVPL
GAATAAVAPR QTAAEAASPQ ERRQPVAFTL TTTAQENKHV EP