Gene SeHA_C3991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3991 
Symbol 
ID6490946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3868730 
End bp3869728 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content56% 
IMG OID642744092 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_002047697 
Protein GI194448452 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAA CTTTCGAAGA GTTAAAAGGG GCCTTCTACC GCGTCTTGCG GTCGCGGAAT 
ATTGCGGAAG ATACCGCCGA CGCCTGCGCG GAAATGTTCG CTCGCACCAC CGAGTCCGGT
GTCTATTCCC ACGGCGTGAA CCGCTTTCCC CGCTTCATTC AGCAACTGGA TAACGGCGAC
ATTATTCCTG ATGCTAAACC GCAGCGGGTT ACCAGCCTCG GCGCCATCGA ACAGTGGGAT
GCTCAGCGCG CTATCGGTAA CCTGACGGCG AAAAAGATGA TGGACCGGGC CATCGAGCTG
GCTTCCGATC ATGGTATTGG CCTGGTGGCG TTACGTAATG CTAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAGA GGCTATATCG GCATTTGCTG GACCAACTCC
ATCGCCGTCA TGCCGCCGTG GGGCGCGAAA GAGTGCCGTA TCGGTACCAA TCCGCTGATC
GTCGCCATCC CGTCTACGCC GATCACGATG GTAGATATGT CGATGTCGAT GTTCTCCTAC
GGGATGTTAG AAGTTAACCG TCTGGCGGGC CGCGAACTGC CGGTGGACGG CGGTTTCGAC
GATAACGGTC AGTTGACCAA AGAACCGGGC GTGATCGAGA AAAATCGCCG CATTTTACCA
ATGGGTTACT GGAAAGGATC TGGTCTGTCG ATTGTGCTGG ACATGATTGC CACCCTGCTT
TCCAACGGCT CTTCCGTTGC CGAAGTGACA CAGGAAAACA GCGATGAATA TGGCGTCTCG
CAGATCTTCA TCGCCATAGA AGTGGATAAG CTGATCGATG GCGCAACCCG CGATGCCAAA
CTGCAGCGGA TTATGGATTT CATCACCACT GCTGAACGCG CCGACGACAA CGTCGCGATT
CGGCTGCCCG GCCACGAATT TACCAAATTG CTGGATGACA ACCGCCGTCA CGGTATCACC
ATTGACGACA GCGTCTGGGC CAAAATTCAG GCGCTGTAA
 
Protein sequence
MKVTFEELKG AFYRVLRSRN IAEDTADACA EMFARTTESG VYSHGVNRFP RFIQQLDNGD 
IIPDAKPQRV TSLGAIEQWD AQRAIGNLTA KKMMDRAIEL ASDHGIGLVA LRNANHWMRG
GSYGWQAAER GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RELPVDGGFD DNGQLTKEPG VIEKNRRILP MGYWKGSGLS IVLDMIATLL
SNGSSVAEVT QENSDEYGVS QIFIAIEVDK LIDGATRDAK LQRIMDFITT AERADDNVAI
RLPGHEFTKL LDDNRRHGIT IDDSVWAKIQ AL