Gene SeD_A4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4053 
Symbol 
ID6875567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3895887 
End bp3896885 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content56% 
IMG OID642787002 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_002217629 
Protein GI198244331 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.771616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.431121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CTTTCGAAGA GTTAAAAGGG GCCTTCTACC GCGTCTTGCG GTCGCGGAAT 
ATTGCGGAAG ATACCGCCGA CGCCTGCGCG GAAATGTTCG CTCGCACCAC CGAGTCCGGT
GTCTATTCCC ACGGCGTGAA CCGCTTTCCC CGCTTCATCC AGCAACTGGA TAACGGCGAC
ATTATTCCTG ATGCTAAACC GCAGCGAGTT ACCAGCCTCG GCGCCATCGA ACAGTGGGAT
GCTCAGCGCG CTATCGGCAA CCTGACGGCG AAAAAGATGA TGGACCGGGC CATCGAGCTG
GCTTCCGATC ATGGTATTGG CCTGGTGGCG TTACGTAATG CTAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATCG GCATTTGCTG GACCAACTCC
ATCGCCGTCA TGCCGCCGTG GGGGGCGAAA GAGTGCCGTA TCGGTACCAA TCCGCTGATC
GTCGCTATCC CTTCCACGCC GATCACGATG GTGGATATGT CGATGTCGAT GTTCTCCTAC
GGAATGTTAG AAGTTAACCG TCTGGCGGGC CGCGAACTGC CGGTGGATGG CGGTTTCGAC
GATAACGGTC AGTTGACCAA AGAACCGGGC GTTATCGAGA AAAATCGCCG CATTTTACCA
ATGGGTTACT GGAAAGGATC TGGTCTGTCG ATTGTGCTGG ACATGATTGC CACCCTGCTT
TCCAACGGCT CTTCCGTTGC CGAAGTGACC CAGGAAAACA GCGATGAGTA TGGCGTCTCA
CAGATTTTCA TCGCCATAGA AGTGGATAAG CTGATCGATG GCGCAACCCG CGATGCCAAA
CTGCAGCGGA TTATGGATTT CATCACCACT GCTGAACGCG CCGACGACAA CGTCGCGATT
CGGCTGCCCG GCCACGAATT TACCAAATTG CTGGATGACA ACCGCCGTCA CGGTATCACC
ATTGACGACA GCGTCTGGGC CAAAATTCAG GCGCTGTAA
 
Protein sequence
MKVTFEELKG AFYRVLRSRN IAEDTADACA EMFARTTESG VYSHGVNRFP RFIQQLDNGD 
IIPDAKPQRV TSLGAIEQWD AQRAIGNLTA KKMMDRAIEL ASDHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RELPVDGGFD DNGQLTKEPG VIEKNRRILP MGYWKGSGLS IVLDMIATLL
SNGSSVAEVT QENSDEYGVS QIFIAIEVDK LIDGATRDAK LQRIMDFITT AERADDNVAI
RLPGHEFTKL LDDNRRHGIT IDDSVWAKIQ AL