Gene SeSA_A1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1410 
SymbolchbC 
ID6519538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1358882 
End bp1360240 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content51% 
IMG OID642746528 
ProductPTS system N,N'-diacetylchitobiose-specific transporter subunit IIC 
Protein accessionYP_002114333 
Protein GI194736376 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.470901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0748148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG TCATTGCTTC ACTTGAAAAG GTACTCCTTC CTTTTGCTGT AAAAATAGGA 
AAGCAGCCAC ACGTTAATGC AATCAAGAAC GGCTTTATTC GTTTAATGCC ATTAACCCTT
GCGGGGGCGA TGTTTGTATT AATTAATAAC GTTTTCCTGA GCTTTGGGGA GGGGTCGTTT
TTTTATTCCA TGGGCATTCG GCTTGATGCC TCAACCATTG AAACACTTAA TGGATTGAAA
GGCATCGGCG GGAACGTCTA TAACGGCACA TTGGGCATTA TGTCGCTCAT GGCGCCATTT
TTTATCGGCA TGGCGTTGGC GGAAGAGCGC AAGGTTGATG CGCTGGCCGC CGGACTGCTC
TCCGTGGCGG CATTTATGAC CGTAACGCCG TATAGTGTCG GCGAGGCCTA TGCCGTAGGC
GCAAACTGGC TGGGCGGCGC GAATATTATC TCCGGTATTA TTATTGGCCT GGTGGTCGCG
GAGATGTTTA CATTTATTGT CCGGCGCAAC TGGGTGATTA AACTACCGGA CAGCGTACCG
GCTTCGGTGT CTCGTTCATT CTCAGCATTA ATTCCCGGCT TTATTATTCT CTCTATTATG
GGGATTATTG CCTGGGCGCT GTCTAATTAC GGTTCTAACT TCCATCAGAT TATTATGGAC
ACTATCTCTA CGCCGCTGGC ATCGCTGGGT AGCGTGGTAG GGTGGGCATA TGTCATTTTT
GTACCGCTGC TGTGGTTCTT TGGTATTCAT GGTTCGCTGG CGCTGACCGC GCTGGACAGC
GGCATCATGA CGCCCTGGGC GCTGGAAAAC ATCTCTATTT ACCAGCAGTA TGGCTCCGTC
GATGCGGCGC TGGAAGCCGG TAAAACGTTC CATATCTGGG CGAAACCGAT GCTGGATTCT
TATATCTTCC TCGGTGGTAG CGGCGCAACG CTGGGTCTGA TCATCGCTAT CTTCCTCGCA
TCTCGTCGCG CGGACTATCG CCAGGTGGCA AAACTGGCGC TGCCGTCAGG TATCTTCCAG
ATTAACGAAC CCATCCTGTT TGGTCTGCCG ATTATTATGA ACCCGGTGAT GTTTATCCCC
TTTATTCTGG TACAACCGAT TCTGGCGGCG ATTACGCTGG TGGCTTACTA TTTGGGTATT
ATTCCGCCGA TTACCAATAT TGCGCCGTGG ACCATGCCAA CCGGGTTGGG GGCGTTCTTT
AACACCAACG GCAGTGTCGC CGCGTTGCTG GTTGCGCTAT TTAACCTGGC GGTCGCTACC
CTGATTTATC TCCCCTTCGT GGTGGTGGCT AACAAAGCGC AGAACGCCAT CGAGCAGGAA
GAAAGCGAAG AAGATATCGC TAACGCACTG AAATTCTAA
 
Protein sequence
MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGSF 
FYSMGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL
SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFIVRRN WVIKLPDSVP
ASVSRSFSAL IPGFIILSIM GIIAWALSNY GSNFHQIIMD TISTPLASLG SVVGWAYVIF
VPLLWFFGIH GSLALTALDS GIMTPWALEN ISIYQQYGSV DAALEAGKTF HIWAKPMLDS
YIFLGGSGAT LGLIIAIFLA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP
FILVQPILAA ITLVAYYLGI IPPITNIAPW TMPTGLGAFF NTNGSVAALL VALFNLAVAT
LIYLPFVVVA NKAQNAIEQE ESEEDIANAL KF