Gene SbBS512_E1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1146 
SymbolgatC 
ID6273047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1037331 
End bp1038686 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID641725275 
ProductPTS system, galactitol-specific IIC component 
Protein accessionYP_001879793 
Protein GI187733334 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID[TIGR00827] PTS system, galactitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAG AAGTCATGCG TTATATTCTC GACCTCGGCC CTACGGTGAT GCTTCCGATT 
GTCATCATTA TTTTTTCTAA AATATTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT
CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC
ATTGGTCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT
GTCGGCTGGC CGGGCTCTTC ACCAATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT
CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT
ATTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC
GGTTCATGGA TGATAGGGAT GGCAGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC
GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGTAT TGCTATTCCG
CACGGTACGT CGGCGTATAT GGGGCCGATT GCGGTGCTGG TCGATGCTAT CATCGAGAAA
ATCCCAGGCG TTAACCGAAT TAAATTTAGC GCCGACGATA TTCAGCGCAA ATTTGGTCCA
TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT
TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCAGTGAT GCTGCTAATG
CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCCCGTAGT
CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTGGATCC CGCATTACTG
CTGGGGCATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TCCCGCTCAC CATTTTAATT
GCTGTTTGTG TTCCGGGTAA TCAGGTGCTG CCGTTTGGCG ATCTTGCCAC CATCGGCTTC
TTTGTGGCGA TGGCAGTCGC CGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT
GTCATCATCA TGAGCATCAC TCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA
CTGGCGGCTA ATGCTGGGGC GTTAAAAGCC GGGGGTATGG TGACTTCAAT GGATCAGGGC
GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC
ATTATTATCG GTGCAATTTA TCTGACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGC
TTTATTAAAC AAGAGAAAGT CGTTCTCGCA GAATAA
 
Protein sequence
MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN
IDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP
HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG
YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL
LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG
VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVTSMDQG GSPITWLLIQ VFSPQNIPGF
IIIGAIYLTG IFMTWRRARG FIKQEKVVLA E