Gene SeHA_C2317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2317 
SymbolrfbG 
ID6491120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2219885 
End bp2220964 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content44% 
IMG OID642742506 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_002046141 
Protein GI194447516 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0156634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA AAAATTTTTG GCAAGGTAAA CGTGTATTCG TTACCGGCCA TACTGGCTTT 
AAAGGAAGCT GGCTTTCGCT ATGGCTGACT GAAATGGGTG CAATTGTAAA AGGCTATGCA
CTTGATGCGC CAACTGTTCC AAGTTTATTT GAGATAGTGC GTCTTAATGA TCTTATGGAA
TCTCATATTG GCGACATTCG TGATTTTGAA AAGCTGCGCA ATTCTATTGC AGAATTTAAG
CCAGAAATTG TTTTCCATAT GGCAGCCCAG CCTTTAGTGC GCCTATCTTA TGAACAGCCA
ATCGAAACAT ACTCAACAAA TGTTATGGGT ACTGTCCATT TGCTTGAAAC AGTTAAGCAA
GTAGGTAACA TAAAGGCAGT CGTAAATATC ACCAGTGATA AGTGCTACGA CAATCGTGAG
TGGGTGTGGG GCTATCGTGA GAACGAACCC ATGGGAGGGT ACGATCCATA CTCTAATAGT
AAAGGTTGTG CAGAATTAGT CGCGTCTGCA TTCCGGAACT CATTCTTCAA TCCTGCAAAT
TATGAGCAAC ATGGCGTTGG TTTGGCGTCT GTGAGGGCTG GTAATGTCAT AGGCGGAGGC
GATTGGGCTA AAGACCGTTT AATTCCCGAT ATTCTGCGCT CATTTGAAAA TAACCAGCAG
GTTATTATTC GAAACCCATA TTCTATCCGT CCCTGGCAGC ATGTACTGGA GCCTCTTTCT
GGTTACATTG TGGTGGCGCA ACGCTTATAT ACAGAAGGTG CTAAGTTTTC TGAAGGATGG
AATTTCGGCC CGCGTGATGA AGATGCGAAG ACGGTCGAAT TTATTGTTGA CAAGATGGTC
ACGCTTTGGG GTGATGATGC AAGCTGGTTA CTGGATGGTG AGAATCATCC TCATGAGGCA
CATTACCTGA AACTGGATTG CTCTAAAGCA AATATGCAAT TAGGATGGCA TCCGCGTTGG
GGATTGACTG AAACACTTGG TCGCATCGTA AAATGGCATA AAGCATGGAT TCGCGGCGAA
GATATGTTGA TTTGTTCAAA GCGTGAAATC AGCGACTATA TGTCTGCAAC TACTCGTTAA
 
Protein sequence
MIDKNFWQGK RVFVTGHTGF KGSWLSLWLT EMGAIVKGYA LDAPTVPSLF EIVRLNDLME 
SHIGDIRDFE KLRNSIAEFK PEIVFHMAAQ PLVRLSYEQP IETYSTNVMG TVHLLETVKQ
VGNIKAVVNI TSDKCYDNRE WVWGYRENEP MGGYDPYSNS KGCAELVASA FRNSFFNPAN
YEQHGVGLAS VRAGNVIGGG DWAKDRLIPD ILRSFENNQQ VIIRNPYSIR PWQHVLEPLS
GYIVVAQRLY TEGAKFSEGW NFGPRDEDAK TVEFIVDKMV TLWGDDASWL LDGENHPHEA
HYLKLDCSKA NMQLGWHPRW GLTETLGRIV KWHKAWIRGE DMLICSKREI SDYMSATTR