Gene SeHA_C1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1442 
SymbolchbC 
ID6488494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1397255 
End bp1398613 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content51% 
IMG OID642741674 
ProductPTS system N,N'-diacetylchitobiose-specific transporter subunit IIC 
Protein accessionYP_002045321 
Protein GI194448756 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG TTATTGCTTC ACTTGAAAAG GTACTCCTTC CTTTTGCTGT AAAAATAGGA 
AAGCAGCCAC ACGTTAATGC AATCAAGAAC GGCTTTATTC GTTTAATGCC ATTAACCCTT
GCGGGGGCGA TGTTTGTATT AATTAATAAC GTTTTCCTGA GCTTTGGGGA GGGGTCGTTT
TTTTATTCCA TGGGCATTCG GCTTGATGCC TCAACCATTG AAACACTTAA TGGATTGAAA
GGCATCGGCG GGAACGTCTA TAACGGCACA TTGGGCATTA TGTCGCTCAT GGCGCCATTT
TTTATCGGCA TGGCGCTGGC GGAAGAGCGC AAGGTTGATG CGCTGGCCGC CGGACTGCTC
TCCGTGGCGG CATTTATGAC CGTAACGCCG TATAGTGTCG GCGAGGCCTA TGCCGTAGGC
GCTAACTGGC TGGGCGGCGC GAATATTATC TCCGGTATTA TTATTGGCCT GGTGGTCGCG
GAGATGTTTA CATTTGTTGT CCGGCGCAAC TGGGTGATTA AACTACCGGA CAGCGTACCG
GCTTCGGTGT CTCGTTCATT CTCAGCATTA ATTCCCGGCT TTATTATTCT CTCTATTATG
GGGATTATTG CCTGGGCGCT GTCTAATTAC GGTTCTAACT TCCATCAGAT TATTATGGAC
ACTATCTCTA CGCCGCTGGC ATCGCTGGGT AGCGTGGTAG GGTGGGCATA TGTCATTTTT
GTACCGCTGC TGTGGTTCTT TGGTATTCAT GGTTCGCTGG CGCTGACCGC GCTGGACAGC
GGCATCATGA CGCCCTGGGC GCTGGAAAAC ATCTCTATTT ACCAGCAGTA TGGCTCCGTC
GATGCGGCGC TGGAAGCCGG TAAAACGTTC CATATCTGGG CGAAACCGAT GCTGGATTCT
TATATCTTCC TCGGTGGTAG CGGCGCAACG CTGGGTCTGA TCATCGCTAT CTTCCTGGCA
TCCCGTCGCG CGGACTATCG CCAGGTGGCA AAACTGGCGC TGCCGTCAGG TATCTTCCAG
ATTAACGAAC CCATCCTGTT TGGCCTGCCG ATTATTATGA ACCCGGTGAT GTTTATCCCC
TTCATTCTGG TACAGCCGAT TCTGGCGGCG ATTACGCTGG TGGCTTACTA TTTGGGTATT
ATTCCGCCGA TTACCAATAT TGCGCCGTGG ACCATGCCAA CCGGGCTGGG GGCGTTCTTT
AACACCAACG GCAGCGTTGC CGCATTACTG GTTGCGCTAT TTAACCTGGC GGTCGCTACC
CTGATTTATC TCCCCTTCGT GGTGGTGGCT AACAAAGCGC AGAACGCCAT TGAGCAGGAA
GAAAGCGAAG AAGATATCGC TAACGCACTG AAATTCTAA
 
Protein sequence
MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGSF 
FYSMGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL
SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFVVRRN WVIKLPDSVP
ASVSRSFSAL IPGFIILSIM GIIAWALSNY GSNFHQIIMD TISTPLASLG SVVGWAYVIF
VPLLWFFGIH GSLALTALDS GIMTPWALEN ISIYQQYGSV DAALEAGKTF HIWAKPMLDS
YIFLGGSGAT LGLIIAIFLA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP
FILVQPILAA ITLVAYYLGI IPPITNIAPW TMPTGLGAFF NTNGSVAALL VALFNLAVAT
LIYLPFVVVA NKAQNAIEQE ESEEDIANAL KF