Gene SeD_A2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2431 
SymbolrfbG 
ID6871427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2303918 
End bp2304997 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content44% 
IMG OID642785521 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_002216179 
Protein GI198242841 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.310419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0000678484 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGATA AAAATTTTTG GCAAGGTAAA CGTGTATTCG TTACCGGCCA TACTGGCTTT 
AAAGGAAGCT GGCTTTCGCT ATGGCTGACT GAAATGGGTG CAATTGTAAA AGGCTATGCA
CTTGATGCGC CAACTGTTCC AAGTTTATTT GAGATAGTGC GTCTTAGTGA TCTTATGGCA
TCTCATATTG GCGACATTCG TGATTTTGAA AAGCTGCGCA ATTCTATTGC AGAATTTAAG
CCAGAAATTG TTTTCCATAT GGCAGCCCAG CCTTTAGTGC GCCTATCTTA TGAACAGCCA
ATCGAAACAT ACTCAACAAA TGTTATGGGT ACTGTCCATT TGCTTGAAGC AGTTAAGCAA
GTAGGTAACA TAAAGGCAGT CGTAAATATC ACCAGTGATA AGTGCTACGA CAATCGTGAG
TGGGTGTGGG GCTATCGTGA GAACGAACCC ATGGGAGGGT ACGATCCATA CTCTAATAGT
AAAGGTTGTG CAGAATTAGT CGCGTCTGCA TTCCGGAACT CATTCTTCAA TCCTGCAAAT
TATGAGCAAC ATGGCGTTGG TTTGGCGTCT GTGAGGGCTG GTAATGTCAT AGGCGGAGGC
GATTGGGCTA AAGACCGTTT AATTCCCGAT ATTCTGCGCT CATTTGAAAA TAACCAGCAG
GTTATTATTC GAAACCCATA TTCTATCCGT CCCTGGCAGC ATGTACTGGA GCCTCTTTCT
GGTTACATTG TGGTGGCGCA ACGCTTATAT ACAGAAGGTG CTAAGTTTTC TGAAGGATGG
AATTTCGGCC CGCGTGATGA AGATGCGAAG ACGGTCGAAT TTATTGTTGA CAAGATGGTC
ACGCTTTGGG GTGATGATGC AAGCTGGTTA CTGGATGGTG AGAATCATCC TCATGAGGCA
CATTACCTGA AACTGGATTG CTCTAAAGCA AATATGCAAT TAGGATGGCA TCCGCGTTGG
GGATTGACTG AAACACTTGG TCGCATCGTA AAATGGCATA AAGCATGGAT TCGCGGCGAA
GATATGTTGA TTTGTTCAAA GCGTGAAATC AGCGACTATA TGTCTGCAAC TACTCGTTAA
 
Protein sequence
MIDKNFWQGK RVFVTGHTGF KGSWLSLWLT EMGAIVKGYA LDAPTVPSLF EIVRLSDLMA 
SHIGDIRDFE KLRNSIAEFK PEIVFHMAAQ PLVRLSYEQP IETYSTNVMG TVHLLEAVKQ
VGNIKAVVNI TSDKCYDNRE WVWGYRENEP MGGYDPYSNS KGCAELVASA FRNSFFNPAN
YEQHGVGLAS VRAGNVIGGG DWAKDRLIPD ILRSFENNQQ VIIRNPYSIR PWQHVLEPLS
GYIVVAQRLY TEGAKFSEGW NFGPRDEDAK TVEFIVDKMV TLWGDDASWL LDGENHPHEA
HYLKLDCSKA NMQLGWHPRW GLTETLGRIV KWHKAWIRGE DMLICSKREI SDYMSATTR