Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3047 |
Symbol | hypE |
ID | 6489186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 2976987 |
End bp | 2977997 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642743202 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_002046821 |
Protein GI | 194449871 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.552535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.00562137 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAACAATA TACAACTCGC CCACGGCAGC GGCGGCCAGG CTATGCAACA GTTGATTAAT AGCCTGTTTA TGGAGGCCTT TGCCAACCCG TGGCTGGCGG AACAAGAAGA TCAGGCGCGT CTGGAGCTGG CGCAGCTGAC GGCGGAGGGC GACCGTCTGG CGTTTTCTAC CGATAGCTAT GTGATCGATC CGCTCTTTTT TCCCGGCGGC AATATCGGCA AACTGGCGAT TTGCGGTACG GCGAACGACG TAGCGGTGAG CGGCGCGATC CCCCGCTATC TCTCCTGCGG CTTTATCCTT GAAGAAGGGC TGCCGATGGA TACGTTAAAA AGCGTGGTCA ATAGCATGGC GGCGACCGCG CGGGAAGCGG GTATCGCCAT TGTAACCGGC GACACCAAAG TTGTGCAGCG CGGCGCGGCG GACAAATTAT TTATCAACAC TGCCGGCATG GGGGCGATTC CCGCCGATAT TCACTGGGGC GCGCAAACGC TGAGCGTTGG CGATGTGCTG TTAGTCAGCG GTACGCTTGG CGATCACGGT GCCACTATTC TCAACCTGCG TGAGCAACTG GGACTGGATG GCGAGCTGGC GAGCGACTGC GCAGTATTAA CGCCGCTTAT TCAGACGCTG CGTCATATTG ACGGAGTGAA GGCATTGCGT GACGCCACGC GCGGCGGCGT GAATGCGGTC GCCCATGAGT TTGCGGCGTC CTGCGGTTAC GGTATTGAAT TGTCCGAATC CGCACTGCCG CTCAAACCTG CCGTGCGCGG CGTCTGCGAG CTGTTGGGGC TGGATGCCCT GAACTTTGCC AACGAAGGAA AACTGGTGAT TGCCGTTGAA CGGCAGGCCG CAGATCGGGC GTTGGCGGCA TTACGCGCGC ATCCGCTGGG ACGTGATGCA GCGCTGATTG GCGAAGTCGT GGAACGCAAA GGCGTTCGCT TAGCCGGACT CTATGGCGTG AAGCGAACCC TTGATTTGCC ACACGCCGAA CCATTACCTC GTATATGCTA G
|
Protein sequence | MNNIQLAHGS GGQAMQQLIN SLFMEAFANP WLAEQEDQAR LELAQLTAEG DRLAFSTDSY VIDPLFFPGG NIGKLAICGT ANDVAVSGAI PRYLSCGFIL EEGLPMDTLK SVVNSMAATA REAGIAIVTG DTKVVQRGAA DKLFINTAGM GAIPADIHWG AQTLSVGDVL LVSGTLGDHG ATILNLREQL GLDGELASDC AVLTPLIQTL RHIDGVKALR DATRGGVNAV AHEFAASCGY GIELSESALP LKPAVRGVCE LLGLDALNFA NEGKLVIAVE RQAADRALAA LRAHPLGRDA ALIGEVVERK GVRLAGLYGV KRTLDLPHAE PLPRIC
|
| |