Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3991 |
Symbol | |
ID | 6490946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3868730 |
End bp | 3869728 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642744092 |
Product | 2,3-diketo-L-gulonate reductase |
Protein accession | YP_002047697 |
Protein GI | 194448452 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.242343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 88 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAA CTTTCGAAGA GTTAAAAGGG GCCTTCTACC GCGTCTTGCG GTCGCGGAAT ATTGCGGAAG ATACCGCCGA CGCCTGCGCG GAAATGTTCG CTCGCACCAC CGAGTCCGGT GTCTATTCCC ACGGCGTGAA CCGCTTTCCC CGCTTCATTC AGCAACTGGA TAACGGCGAC ATTATTCCTG ATGCTAAACC GCAGCGGGTT ACCAGCCTCG GCGCCATCGA ACAGTGGGAT GCTCAGCGCG CTATCGGTAA CCTGACGGCG AAAAAGATGA TGGACCGGGC CATCGAGCTG GCTTCCGATC ATGGTATTGG CCTGGTGGCG TTACGTAATG CTAACCACTG GATGCGCGGC GGCAGCTACG GCTGGCAGGC GGCGGAAAGA GGCTATATCG GCATTTGCTG GACCAACTCC ATCGCCGTCA TGCCGCCGTG GGGCGCGAAA GAGTGCCGTA TCGGTACCAA TCCGCTGATC GTCGCCATCC CGTCTACGCC GATCACGATG GTAGATATGT CGATGTCGAT GTTCTCCTAC GGGATGTTAG AAGTTAACCG TCTGGCGGGC CGCGAACTGC CGGTGGACGG CGGTTTCGAC GATAACGGTC AGTTGACCAA AGAACCGGGC GTGATCGAGA AAAATCGCCG CATTTTACCA ATGGGTTACT GGAAAGGATC TGGTCTGTCG ATTGTGCTGG ACATGATTGC CACCCTGCTT TCCAACGGCT CTTCCGTTGC CGAAGTGACA CAGGAAAACA GCGATGAATA TGGCGTCTCG CAGATCTTCA TCGCCATAGA AGTGGATAAG CTGATCGATG GCGCAACCCG CGATGCCAAA CTGCAGCGGA TTATGGATTT CATCACCACT GCTGAACGCG CCGACGACAA CGTCGCGATT CGGCTGCCCG GCCACGAATT TACCAAATTG CTGGATGACA ACCGCCGTCA CGGTATCACC ATTGACGACA GCGTCTGGGC CAAAATTCAG GCGCTGTAA
|
Protein sequence | MKVTFEELKG AFYRVLRSRN IAEDTADACA EMFARTTESG VYSHGVNRFP RFIQQLDNGD IIPDAKPQRV TSLGAIEQWD AQRAIGNLTA KKMMDRAIEL ASDHGIGLVA LRNANHWMRG GSYGWQAAER GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY GMLEVNRLAG RELPVDGGFD DNGQLTKEPG VIEKNRRILP MGYWKGSGLS IVLDMIATLL SNGSSVAEVT QENSDEYGVS QIFIAIEVDK LIDGATRDAK LQRIMDFITT AERADDNVAI RLPGHEFTKL LDDNRRHGIT IDDSVWAKIQ AL
|
| |