Gene Spro_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2053 
Symbol 
ID5607094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2246509 
End bp2247546 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID640937591 
ProductLacI family transcription regulator 
Protein accessionYP_001478284 
Protein GI157370295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA AGAAAACCAC CCTGGCCGCC ATTGCCAGAG AGGCACATGT TGGTATCGCT 
ACGGTTGATC GGGTAATAAA CCAGCGCGCA ACAGTACGGC CGGAAACGGC ACGCAGGGTC
ATTGCGGCGG CGCACAAATT AGGCTTCGCA CTAGAGAAAT CGCATCAACT GTTTGAGACG
ATGGGCCAAC CAGCCGCACG GATTAAAATG GGCTTTATCC TGCTGCGCAA GGAACATTCG
TTCTACGCAC AATTGGCGGA TAGCCTGCTG GAACAGGCCG CACCCTATTA CGACGCTGAG
CACCCGCCAC AGTTCATGTT TCATGATATC AGTGCGGTCA GCGACACCGC TGCCGCCATC
ACACAGTTAA GCCAAAATGT GGATGTGATC GGCGTGCTGG CGCTGGATAA CCCGATGATC
CGTTTTGCGG TGGAAGAGGC TAGCAGGCAG GGGGTAAAGG TATTCACGTT GCTGTCTGAT
TTATCGGTAC ACAGCCGTGC CGGTTATATC GGCTGGGATA ACCAGCAGGC AGGTCGTACC
GCCGGCTGGG CGGTGGAGCG CTTGTGCCAT CGGCAGGGCG ATGTCGGGGT CATTATCGGC
GATAACCGTT TTCTGTGTCA GGAAACCTGC GAAATCAGCT TTCGATCTTA CCTGCGCGAA
CACCTGAGTG GTCTGCGGGT GCTTGAGCCG GTACGCAGTC ATGAACGGAC CGAAAGCGCA
AGACAAGTCA CTCAAACCTT GCTGGAGCAG CATCCCAATC TGGTGGCTCT GTATGCACCC
TGCGGGGGTG TGGAAGGGAT TATTGCTGCG CTGCGGGAAA GCGGCAGGCA GCATCAGGTG
ATGCTGATTT GCCATGGCCC GGTTACAGGT GGCGAAATGG CGCTGATCGA CGGCACGCTG
GATCTGATGC TCAGACATCG TATTGCCGAG TTTGCGGCGT CAGTCATCAG CACTTTTGTC
GCCGCCACCG TTGGCGGCTC CTCCGGTTTT AGTCACACCA TTAACCGCTT TGATCTGATC
ACCAAAGAAA ACCTCTGA
 
Protein sequence
MAGKKTTLAA IAREAHVGIA TVDRVINQRA TVRPETARRV IAAAHKLGFA LEKSHQLFET 
MGQPAARIKM GFILLRKEHS FYAQLADSLL EQAAPYYDAE HPPQFMFHDI SAVSDTAAAI
TQLSQNVDVI GVLALDNPMI RFAVEEASRQ GVKVFTLLSD LSVHSRAGYI GWDNQQAGRT
AGWAVERLCH RQGDVGVIIG DNRFLCQETC EISFRSYLRE HLSGLRVLEP VRSHERTESA
RQVTQTLLEQ HPNLVALYAP CGGVEGIIAA LRESGRQHQV MLICHGPVTG GEMALIDGTL
DLMLRHRIAE FAASVISTFV AATVGGSSGF SHTINRFDLI TKENL