Gene SNSL254_A2273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2273 
SymbolrfbG 
ID6485624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2183705 
End bp2184784 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content43% 
IMG OID642737620 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_002041362 
Protein GI194445789 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000124958 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGATA AAAATTTTTG GCAAGGTAAA CGTGTATTCG TTACCGGCCA TACTGGCTTT 
AAAGGAAGCT GGCTTTCGCT ATGGCTGACT GAAATGGGTG CAATTGTAAA AGGCTATGCA
CTTGATGCGC CAACTGTTCC AAGTTTATTT GAGATAGTGC ATCTTAATGA TCTTATGGAA
TCTCATATTG GCGATATTCG TGATTTTGAA AAGCTGCGCA ATTCTATTGC AGAATTTAAG
CCAGAAATTG TTTTCCATAT GGCAGCCCAG CCTTTAGTGC GCCTATCTTA TGAACAGCCA
ATCGAAACAT ACTCAACAAA TGTTATGGGT ACTGTCCATT TGCTTGAAGC AGTTAAGCAA
GTAGGTAACA TAAAGGCAGT CGTAAATATC ACCAGTGATA AGTGCTACGA CAATCGTGAG
TGGGTGTGGG GCTATCGTGA GAACGAACCC ATGGGAGGGT ACGATCCATA CTCTAATAGT
AAAGGTTGTG CAGAATTAGT CGCGTCTGCA TTCCGGAACT CATTCTTCAA TCCTGCAAAT
TATGAGCAAC ATGGCGTTGG TTTGGCGTCT GTGAGGGCTG GTAATGTCAT AGGCGGAGGC
GATTGGGCTA AAGACCGTTT AATTCCCGAT ATTCTGCGCT CATTTGAAAA TAACCAGCAG
GTTATTATTC GAAACCCATA TTCTATCCGT CCATGGCAGC ATGTACTGGA GCCTCTTTCT
GGTTACATTG TGGTGGCGCA ACGCTTATAT ACAGAAGGTG CTAAGTTTTC TGAAGGATGG
AATTTCGGCC CGCGTGATGA AGATGCGAAG ACGGTCGAAT TTATTGTTGA CAAGATGGTC
ACGCTTTGGG GTGATGATGC AAGCTGGTTA CTGGATGGTG AGAATCATCC TCATGAGGCA
CATTATCTGA AACTGGATTG CTCTAAAGCA AATATGCAAT TAGGATGGCA TCCGCGTTGG
GGATTGACTG AAACACTTGG TCGCATCGTA AAATGGCATA AAGCATGGAT TCGCGGCGAA
GATATGTTGA TTTGTTCAAA GCGTGAAATC AGCGACTATA TGTCTGCAAC TACTCGTTAA
 
Protein sequence
MIDKNFWQGK RVFVTGHTGF KGSWLSLWLT EMGAIVKGYA LDAPTVPSLF EIVHLNDLME 
SHIGDIRDFE KLRNSIAEFK PEIVFHMAAQ PLVRLSYEQP IETYSTNVMG TVHLLEAVKQ
VGNIKAVVNI TSDKCYDNRE WVWGYRENEP MGGYDPYSNS KGCAELVASA FRNSFFNPAN
YEQHGVGLAS VRAGNVIGGG DWAKDRLIPD ILRSFENNQQ VIIRNPYSIR PWQHVLEPLS
GYIVVAQRLY TEGAKFSEGW NFGPRDEDAK TVEFIVDKMV TLWGDDASWL LDGENHPHEA
HYLKLDCSKA NMQLGWHPRW GLTETLGRIV KWHKAWIRGE DMLICSKREI SDYMSATTR