Gene SAG0330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0330 
SymbolcelB 
ID1013119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp338026 
End bp339327 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content37% 
IMG OID637315522 
ProductPTS system, cellobiose-specific IIC component 
Protein accessionNP_687364 
Protein GI22536513 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAGT TTGATAGTCA GAAAATAATT ACTCCGATTA TGAAGTTTGT CAATATGCGA 
GGGATTATTG CACTCAAAGA TGGCATGCTA GCAATTCTAC CACTAACAGT TGTTGGGAGT
CTCTTTTTAA TATTAGGGCA GCTTCCATTT AAAGGTCTTA ATCAAGCCAT AGCTAATGTT
TTTGGACCAG AATGGACAGA ACCATTTATG CAAGTTTATT CAGGAACTTT TGCGATTATG
GGCTTGATTT CCTGTTTTGC AATTGCTTAT GCTTATGCTA AAAATAGTAG TGTAGAACCG
TTACCTGCTG GCGTTTTATC ACTTTCGTCT TTCTTTATTT TAATGAAATC ATCTTATATT
CCTGTAAAAG GAGAAGCAAT AGCGGATGCT ATTTCTAAGG TTTGGTTTGG CGGTCAGGGG
ATTATTGGAG CAATTATTAT TGGCTTAGTA GTTGGTGCTA TCTATACATG GTTTATCCAA
CATCATATTG TTATAAAAAT GCCAGAGCAA GTACCACAGG CAATAGCAAA ACAATTTGAA
GCTATGATTC CAGCTTTTGT TATCTTTCTT TTATCGATGA TTGTTTATTT GATTGCTAAG
GTAACAACTG GAGGTACCTT TATTGAGATG ATTTATGATA TCATTCAAGT ACCGTTGCAA
GGCTTAACAG GCTCACTTTA TGGAGCAATT GGGATTGCTT TCTTTATTTC ATTCTTATGG
TGGTTTGGTG TCCATGGCCA ATCTGTGGTG AATGGTATTG TGACTGCTTT GTTACTATCC
AATTTAGATG CCAATAAGTC TTTATTAGCA GCAAATCGCT TAACATTAGA TAATGGCGCT
CACATTGTAA CACAGCAGTT TTTGGATAGT TTCTTGATTT TATCGGGCTC AGGAATAACT
TTTGGACTAG TTATTGCGAT GCTTTTTGCA GCAAAATCAA AACAATACAA GGCACTTGGG
AAAGTGGCCG CTTTTCCCGC AATTTTCAAT GTTAACGAGC CAATCGTCTT TGGCTTTCCA
ATCGTAATGA ATCCTGTGAT GTTTCTGCCT TTTATTTTAG TACCAGTTTT GGCAGCTTTA
ATTGTTTATG GAGCTATTGC GGTTGGTTTT ATGCAACCAT TCTCAGGTGT TACTTTACCA
TGGAGTACTC CAGCAATTAT TTCTGGATTC ATGGTAGGTG GTTGGCAAGG TGCACTCGTC
CAAATAGTCA TTTTAGCTAT CTCAACTGCT GTTTACTTCC CATTCTTTAA AATCCAAGAT
AATATTACTT ACAAAAATGA ATGTGAAATG GAAAGGGGAT AG
 
Protein sequence
MSKFDSQKII TPIMKFVNMR GIIALKDGML AILPLTVVGS LFLILGQLPF KGLNQAIANV 
FGPEWTEPFM QVYSGTFAIM GLISCFAIAY AYAKNSSVEP LPAGVLSLSS FFILMKSSYI
PVKGEAIADA ISKVWFGGQG IIGAIIIGLV VGAIYTWFIQ HHIVIKMPEQ VPQAIAKQFE
AMIPAFVIFL LSMIVYLIAK VTTGGTFIEM IYDIIQVPLQ GLTGSLYGAI GIAFFISFLW
WFGVHGQSVV NGIVTALLLS NLDANKSLLA ANRLTLDNGA HIVTQQFLDS FLILSGSGIT
FGLVIAMLFA AKSKQYKALG KVAAFPAIFN VNEPIVFGFP IVMNPVMFLP FILVPVLAAL
IVYGAIAVGF MQPFSGVTLP WSTPAIISGF MVGGWQGALV QIVILAISTA VYFPFFKIQD
NITYKNECEM ERG