Gene Spro_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3931 
Symbol 
ID5603976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4355351 
End bp4356451 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID640939491 
Producthypothetical protein 
Protein accessionYP_001480154 
Protein GI157372165 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC TTTGGATATT GCCCACCACC CTGCTGCTGG CCGGCGCCTC ATCATCCGCC 
TGGGCCGATG CCTATCTCGA CCAGGCCACG GCGGCGGTCA CCAAGGCCAC CGCCAGCGTG
ACCCAGTGGG ATGGCCCCAC CAGCGGGCCA AAGTTACAGG CCAACAAGAA AATCATCTTT
ATCGCTTCGG ACATGAAAAA CGGCGGCGTG CAGGGCGTGC AGGAAGGGCT GAGTGAAGCG
GCGAAAGCGG CCGGCTGGAA ATTGGAAACC CTGGATGGCG GCGGCTCGGT GAAAGATCAA
TTGGCGTCAC TGAACCAGGC GATTGCCCAG AAGCCGGACG GTATCGTGAT CGGCGGCTGG
AACCCGAACG TGGCCAAGAT CCCGCTGAAA AAAGCCATCC AGCAAGGCAT TGTGCTGACC
GCCTGGCATG CGGTGCCTGA GCCAGGGCCA ATCGCCAAAT ACAACGTGTT TTACAATGTC
ACCTCCGACT CCAATGACGT GGCACGCATC GCCGCCCAGT ATGCGGTGGT GCAGTCCGGC
GGCAAGGCCA ACGTGTTGAT CTTTACCGAT TCGCTGTACC AAATCGCGCT GGACAAGGCC
AACGTGATGA AAGAGGAAAT CGGCAAATGC AGTGGCTGCA AGGTGGTCGA GTTTATCGAC
ACCCCGCTGG CGGATACCGC CAACCGCATG CCGGCCATGA CTTTCAGCCT GCTGCAAAAA
TACGGTGACC AGTTCCAGTA CGCGCTGGCC ATTAACGATC TGTATTTTGA TTTTATGGCC
CCGGCGCTGA AAACCGCGGG CAAAGGCGGC AACAATGCGC CCTACAGCAT CTCGGCCGGC
GACGGCTCTA TTTCCGCCTA TCAACGCATC CGCTCTGGGG GCAGCCAATC GGCTACGGTA
CCGGAACCGC TTAAACTGCA CGGCTGGCAA CTGCTGGATG AGTTTAACCG CGCCTTCGCC
AAACAGCCGC CTTCCGGCTA CATCACCCCG GCGCACCTGG TGACCCGCGA CAATATCACT
GCCGACGGTG GCAGCAGCAA CCGGTATGAC CCGCAAAATG ATTATCAGGG CCACTACAAA
GCCATTTGGG GCGTGAAGTA A
 
Protein sequence
MKHLWILPTT LLLAGASSSA WADAYLDQAT AAVTKATASV TQWDGPTSGP KLQANKKIIF 
IASDMKNGGV QGVQEGLSEA AKAAGWKLET LDGGGSVKDQ LASLNQAIAQ KPDGIVIGGW
NPNVAKIPLK KAIQQGIVLT AWHAVPEPGP IAKYNVFYNV TSDSNDVARI AAQYAVVQSG
GKANVLIFTD SLYQIALDKA NVMKEEIGKC SGCKVVEFID TPLADTANRM PAMTFSLLQK
YGDQFQYALA INDLYFDFMA PALKTAGKGG NNAPYSISAG DGSISAYQRI RSGGSQSATV
PEPLKLHGWQ LLDEFNRAFA KQPPSGYITP AHLVTRDNIT ADGGSSNRYD PQNDYQGHYK
AIWGVK