Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4773 |
Symbol | |
ID | 6491602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4652307 |
End bp | 4653851 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642744826 |
Product | hypothetical protein |
Protein accession | YP_002048399 |
Protein GI | 194448194 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.0676995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCATA ACATGAAGAA AAACCCTGTA AGTATACCAC ACTCCATTTG GCCCGCCGAT GACATCAAAC GGCTGGAACG CGATGCGGCG GATGCCTTCG GACTCACACT CTATGAATTG ATGCTGCGCG CTGGCGACGC GGCATTTCGG GTAGCCCGTG ACAGTTATCC TGACACCCGA CACTGGCTGG TGTTGTGTGG TCATGGCAAC AACGGCGGCG ATGGTTACGT CGTGGCGCGA CTAGCGCAAG CGGCGGGCAT TAGCGTAACG TTGCTGGCGC AGGAGAGCGA CAAACCGTTA CCTGAAGAAG CGGCGCAGGC GCGCGATGCC TGGCTGAATG CTGGCGGCAT TATCCATGCT GCCGATATTA TCTGGCCGGA AGCGACGGAT CTGATTATCG ACGCGCTGCT TGGCACCGGC ATAGCCCAGG CGCCGCGCGA CCCGGTAGCC GGTCTGATTG AACAGGCGAA CGCTCATCCT GCGCCGGTTG TCGCCGTCGA TATCCCGTCA GGTCTGCTGG CGCAAACGGG CGCCACGCCT GGCGCGGTGA TAAGCGCCGC GCATACGGTC ACGTTTATCG CCCTGAAACC AGGCCTGCTG ACCGGCAAAG CGCGTGACGT TACCGGCATA TTGCATTATG ACGCGTTGGG ACTGGAAGGC TGGCTGGCGA GCCAGACGCC GCCGCTCCGG CGTTTTGACG CGACGCAGTT GGGGCAATGG CTAACGCCGC GTCGACCGAC CTCGCATAAG GGCGATCATG GTCGTCTGGC GATTATTGGA GGCGACCAGG GAACAGCGGG CGCAATTCGG ATGGCTGGCG AGGCGGCGCT GCGTACGGGA GCTGGCTTGG TCAGAGTACT GACCCGCGGT GAAAACATCG CGCCGTTGCT GACGGCCCGC CCGGAACTGA TGGTACATGA ACTCACGCCT CAGTCGCTGG AAGAGAGCCT GACCTGGGCT GACGTTGTGG TGATCGGCCC GGGGCTTGGG CAGCAGGAAT GGGGCAAAAA AGCCTTACAG AAAGTAGAAA ACGTCCGTAA ACCTATGCTG TGGGACGCGG ATGCGTTGAA CCTACTGGCA ATCAATCCTG ATAAACGTCA CAATCGCGTG ATTACGCCGC ATCCGGGAGA AGCTGCCCGC CTGTTAGGAT GTTCTGTGGC AGAAATTGAA AGTGATCGCT TACTTTCAGC ACAGCGTCTG GTAAAACGGT ACGGAGGCGT GGTCGTGTTA AAAGGCGCAG GAACGATTAT CGCCGCTGAA CACCACCCTC TGGCTATCAT TGACGCTGGT AATGCGGGGA TGGCGAGCGG CGGGATGGGC GATGTCCTGT CCGGCATCAT CGGCGCATTG CTCGGACAGA AGTTTACCCC GTATGATGCG GCATGTGTGG GATGTGTGGC TCACGGCGCG GCGGCGGACT TACTGGCAGC GCGTTATGGC GCTCGCGGCA TGTTGGCGAC CGATCTTTTT ACTACGCTGC GGCGTATTGT TAACCCTGAT GTGATTGACG TAAACCATGA TGAATCGAGT AATTCCGCTA CCTGA
|
Protein sequence | MDHNMKKNPV SIPHSIWPAD DIKRLERDAA DAFGLTLYEL MLRAGDAAFR VARDSYPDTR HWLVLCGHGN NGGDGYVVAR LAQAAGISVT LLAQESDKPL PEEAAQARDA WLNAGGIIHA ADIIWPEATD LIIDALLGTG IAQAPRDPVA GLIEQANAHP APVVAVDIPS GLLAQTGATP GAVISAAHTV TFIALKPGLL TGKARDVTGI LHYDALGLEG WLASQTPPLR RFDATQLGQW LTPRRPTSHK GDHGRLAIIG GDQGTAGAIR MAGEAALRTG AGLVRVLTRG ENIAPLLTAR PELMVHELTP QSLEESLTWA DVVVIGPGLG QQEWGKKALQ KVENVRKPML WDADALNLLA INPDKRHNRV ITPHPGEAAR LLGCSVAEIE SDRLLSAQRL VKRYGGVVVL KGAGTIIAAE HHPLAIIDAG NAGMASGGMG DVLSGIIGAL LGQKFTPYDA ACVGCVAHGA AADLLAARYG ARGMLATDLF TTLRRIVNPD VIDVNHDESS NSAT
|
| |