Gene SNSL254_A4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4202 
SymbolrfbB2 
ID6483175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4097989 
End bp4099056 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content56% 
IMG OID642739456 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_002043159 
Protein GI194442418 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA TTCTGGTGAC CGGCGGCGCA GGCTTTATCG GATCTGCGGT GGTACGGCAT 
ATCATCCATG AAACGGCAGA CGCGGTGGTG GTGGTGGATA AACTCACCTA TGCGGGCAAC
CTGATGTCTC TGGCATCGGT GACGCAAAGC GACCGTTTCG CCTTTGAGAA GGTGGATATC
TGCGATCGGG CATCACTGGA GCGAGTCTTC CAGCAGTATC ATCCCAATAG CGTGATGCAC
CTGGCGGCGG AAAGCCATGT AGACCGTTCC ATCGACGGCC CGGCGGCGTT TATTGAAACG
AATATAGTTG GTACTTACAC CTTGCTGGAA GCCGCTCGCG CTTACTGGTC CGCGCTTGAC
GCGGACGCTA AAGCGGCGTT CCGCTTCCAC CATATCTCCA CCGATGAAGT GTATGGCGAT
CTGCATACTG CAGACGATTT CTTCACCGAA ACCACGCCAT ATGCGCCAAG CAGCCCTTAT
TCCGCCTCCA AAGCCAGCAG CGACCATCTG GTACGCGCCT GGTTACGTAC CTACGGTCTG
CCTACGCTTG TCACCAACTG CTCTAATAAC TACGGGCCCT ACCATTTCCC GGAAAAACTG
ATCCCGCTGA TGATTCTGAA CGCGCTGGCG GGTAAACCAT TGCCGGTCTA TGGCAACGGT
CAGCAAATTC GCGACTGGCT GTATGTAGAG GATCATGCCC GTGCGTTGTA TCACGTGGTG
ACGAACGGTG CGGTGGGCGA AACGTATAAT ATCGGCGGTC ATAACGAACG TAAAAATCTG
GATGTGGTCA GGACGATCTG CGCTCTGTTG GAGGAACTGG CCCCGCAAAA ACCGCAGGGT
GTGGCGAATT ATCACGATCT GATTACCTTC GTCGACGATC GCCCCGGCCA TGACTTACGC
TATGCCATCG ACGCGTCGAA AATCGCCCGC GAGCTGGGCT GGACGCCGCA GGAAACCTTC
GAAAGCGGGA TGCGAAAAAC CGTTCAGTGG TATCTCGCCA ATGAGGCCTG GTGGAAACCC
GTGCAGGATG GCAGTTATCA GGGCGAACGC TTAGGGCTGA AACGCTAA
 
Protein sequence
MKRILVTGGA GFIGSAVVRH IIHETADAVV VVDKLTYAGN LMSLASVTQS DRFAFEKVDI 
CDRASLERVF QQYHPNSVMH LAAESHVDRS IDGPAAFIET NIVGTYTLLE AARAYWSALD
ADAKAAFRFH HISTDEVYGD LHTADDFFTE TTPYAPSSPY SASKASSDHL VRAWLRTYGL
PTLVTNCSNN YGPYHFPEKL IPLMILNALA GKPLPVYGNG QQIRDWLYVE DHARALYHVV
TNGAVGETYN IGGHNERKNL DVVRTICALL EELAPQKPQG VANYHDLITF VDDRPGHDLR
YAIDASKIAR ELGWTPQETF ESGMRKTVQW YLANEAWWKP VQDGSYQGER LGLKR