Gene SbBS512_E0680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0680 
SymbolgalE 
ID6271649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp645324 
End bp646340 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID641724876 
ProductUDP-galactose-4-epimerase 
Protein accessionYP_001879409 
Protein GI187731338 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTC TGGTTACCGG TGGTAGCGGT TACATTGGAA GTCATACCTG CGTGCAATTA 
CTGCAAAACG GTCATGATGT CATCATTCTT GATAACCTCT GTAACAGTAA GCGCAGCGTA
CTGCCTGTTA TCGAGCGTTT AGGCGGCAAA CATCCAACGT TTGTTGAAGG CGATATTCGT
AACGAAGCGT TGATGACCGA GATCCTGCAC GATCACGCTA TCGACACCGT GATCCACTTC
GCCGGGCTGA AAGCCGTTGG CGAATCGGTA CAAAAACCGC TGGAATATTA CGACAACAAT
GTCAACGGTA CTCTGCGCCT GATTAGCGCC ATGCGCGCCG CTAACGTCAA AAACTTTATT
TTTAGCTCCT CCGCCACCGT TTATGGCGAT CAGCCCAAAA TTCCATACGT TGAAAGCTTC
CCGACCGGCA CACCGCAAAG CCCTTACGGC AAAAGCAAGC TGATGGTGGA ACAGATCCTC
ACCGATCTGC AAAAAGCCCA GCCGGACTGG AGCATTGCCC TGCTGCGCTA CTTCAACCCG
GTTGGCGCGC ATCCGTCGGG CGATATGGGC GAAGATCCGC AAGGCATTCC GAATAACCTG
ATGCCATACA TCGCCCAGGT TGCTGTAGGC CGTCGCGACT CGCTGGCGAT TTTTGGTAAC
GATTATCCGA CCGAAGATGG TACTGGCGTA CGCGATTACA TCCACGTAAT GGATCTGGCG
GACGGTCACG TCGTGGCGAT GGAAAAACTG GCGAACAAGC CAGGCGTACA CATCTACAAC
CTCGGCGCTG GCATAGGCAG CAGCGTGCTG GACGTGGTTA ATGCCTTCAG CAAAGCCTGC
GGCAAACCGG TTAACTATCA TTTTGCACCG CGTCGCGAGG GCGACCTTCC GGCCTACTGG
GCGGACGCCA GCAAAGCCGA CCGTGAACTG AACTGGCGCG TAACGCGCAC ACTCGATGAA
ATGGCGCAGG ACACCTGGCA CTGGCAGTCA CGCCATCCAC AGGGATATCC CGATTAA
 
Protein sequence
MRVLVTGGSG YIGSHTCVQL LQNGHDVIIL DNLCNSKRSV LPVIERLGGK HPTFVEGDIR 
NEALMTEILH DHAIDTVIHF AGLKAVGESV QKPLEYYDNN VNGTLRLISA MRAANVKNFI
FSSSATVYGD QPKIPYVESF PTGTPQSPYG KSKLMVEQIL TDLQKAQPDW SIALLRYFNP
VGAHPSGDMG EDPQGIPNNL MPYIAQVAVG RRDSLAIFGN DYPTEDGTGV RDYIHVMDLA
DGHVVAMEKL ANKPGVHIYN LGAGIGSSVL DVVNAFSKAC GKPVNYHFAP RREGDLPAYW
ADASKADREL NWRVTRTLDE MAQDTWHWQS RHPQGYPD