Gene Spro_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4223 
Symbol 
ID5603177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4682168 
End bp4683487 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content56% 
IMG OID640939783 
ProductPTS system lactose/cellobiose family IIC subunit 
Protein accessionYP_001480445 
Protein GI157372456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.298661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCA GCCAGGCCGC ATTCAATTTT ATTGAAAACC GTATCAGCCC CATTGCCGGT 
AAACTATCGA CCCAGCGACA TATTATGGCG ATCCGCGACG GTTTTATTTC CGCGATGCCC
TTTATGATCG TCGGCTCATT CTTATTGGTA TTTGCCTACC CGCCTTTTTC GGCCGACAGC
AGTTGGGGAA TAGCTCAATG GTGGCTGGGT GCTGCGGAGA AACATCAGGT CGCTATTCTC
ACCCCGTTTA ATATGACCAT GGGGATTATG TCGATTTATA TTACTGCCGC CATCGCTTAT
AACCTGGCGC AGAGTTATAA GCTCGATCCC TTTATGGCGG CCATGCTGGC GCTGATGTCA
TTTTTGCTGG TCGCCGCGCC GCAGACCGAA AAGATGCTGC CGACCGCTGC GTTGGGCGGC
GTGGGGATCT TCACCGCCAT TTTGGTGGCG GTCTATACCA CCGAACTGAT CCGCTTTCTC
AAGCAGCACA ATATCGGTAT CTCACTGCCG GAACAGGTGC CGGCCAAGAT CAAACAGTCG
TTTGATCTGC TGATCCCGAT CCTGGCGGTG GTGATCACCC TGTATCCGCT TAGCCTGCTG
GTGCAGCATC AGTTCAACTT ACTGTTGCCA CAGGCGATCA TGGCGCTGTT CCAGCCGCTG
ATTTCCGCTG CGGATTCGCT GCCGGCGATC CTGCTCGCGG TGCTGATCGG CCATTTATTG
TGGTTCGCCG GTATTCACGG CGCGGTGATC GTCTCCGGCA TGCTGCAGGC ATTCTGGCTG
ACCAATCTGG GCATCAATCA GGATGCGTTG GCCGCCGGCC ACCCGATGCC GCACATCTTT
ATGGAAGCTT TCTGGACCTT CTTTATCGTC ATCGGCGGAT CCGGCGCCAC CTTTGGGCTG
GTGCTGCTTT ATCTGCGCAG CCGTTCGGCG CACCTGCGTT CGATCGGCAA GCTGAGCCTG
GTGCCGAGCT GCTTCAACAT CAATGAACCG GTGATCTTCG GCACGCCGAT CGTCATGAAC
CCGACCTTCT TTATCCCGTT TATTACCGCG CCGATCGTCA ATTCGATCAT TGCCTATGCG
GCCGTGAAGC TGGATCTGAT TGGCCGCGTG ATCTCGGTAG TGCCCTGGAC GGCACCGGCA
CCGATTGGCG CTGCCTGGGC CACCGGTTGG GATCTCCGTG CCGCACTGCT GGTGCTGCTG
CTGGCGGCGG TATCGGCACT GATTTACTAC CCGTTCTTCA AGGTCTATGA GCAGCAGTTG
CTGGATCAGG AAGTGAGTGA GGCGGAGCAG ATCGAACAGA CACAAGGAGT CACCGAATGA
 
Protein sequence
MSISQAAFNF IENRISPIAG KLSTQRHIMA IRDGFISAMP FMIVGSFLLV FAYPPFSADS 
SWGIAQWWLG AAEKHQVAIL TPFNMTMGIM SIYITAAIAY NLAQSYKLDP FMAAMLALMS
FLLVAAPQTE KMLPTAALGG VGIFTAILVA VYTTELIRFL KQHNIGISLP EQVPAKIKQS
FDLLIPILAV VITLYPLSLL VQHQFNLLLP QAIMALFQPL ISAADSLPAI LLAVLIGHLL
WFAGIHGAVI VSGMLQAFWL TNLGINQDAL AAGHPMPHIF MEAFWTFFIV IGGSGATFGL
VLLYLRSRSA HLRSIGKLSL VPSCFNINEP VIFGTPIVMN PTFFIPFITA PIVNSIIAYA
AVKLDLIGRV ISVVPWTAPA PIGAAWATGW DLRAALLVLL LAAVSALIYY PFFKVYEQQL
LDQEVSEAEQ IEQTQGVTE