Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3168 |
Symbol | hypE |
ID | 6871347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3046866 |
End bp | 3047876 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642786189 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_002216830 |
Protein GI | 198242620 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.707243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACAATA TACAACTCGC CCACGGCAGC GGCGGCCAGG CTATGCAACA GTTGATTAAT AGCCTGTTTA TGGAGGCCTT TGCCAACCCG TGGCTGGCGG AACAAGAAGA TCAGGCGCGT CTGGAGCTGG CGCAACTGAC GGCGGAGGGC GACCGTCTGG CGTTTTCTAC CGATAGCTAT GTGATCGATC CGCTCTTTTT CCCCGGCGGC AATATCGGCA AACTGGCGAT TTGCGGTACG GCGAACGACG TTGCGGTGAG CGGCGCGATC CCCCGCTATC TCTCCTGCGG CTTTATCCTT GAAGAAGGGC TGCCGATGGA GACGTTAAAA AGCGTGGTCA ATAGCATGGC GGCGACCGCG CGGGAAGCGG ATATCGCCAT TGTAACCGGC GACACCAAAG TTGTGCAGCG CGGCGCGGCG GATAAATTAT TTATCAACAC TGCCGGCATG GGGGCGATTC CCGCCGATAT TCGCTGGGGC GCGCAAACGC TGAGCGTTGG CGATGTGCTG TTAGTCAGCG GTACGCTTGG CGATCACGGT GCCACTATCC TCAACCTGCG TGAGCAACTG GGACTGGATG GCGAGCTGGC GAGCGACTGC GCAGTATTAA CGCCGCTTAT TCAGACGCTG CGTCATATTG ACGGAGTGAA GGCATTGCGT GACGCCACGC GCGGCGGCGT GAATGCGGTC GCCCATGAGT TTGCAACGTC CTGCGGTTAC GGTATTGAAT TGTCCGAATC CGCACTGCCG CTCAAACCTG CCGTGCGCGG CGTCTGCGAG CTGTTGGGGC TGGATGCCCT GAACTTTGCC AACGAAGGAA AACTGGTGAT TGCCGTTGAA CGGCAGGCCG CAGATCGGGC GTTGGCGGCA TTACGCGCGC ATCCGCTGGG ACGTGATGCA GCGCTGATTG GCGAAGTCGT GGAACGCAAA GGCGTTCGCT TAGCCGGACT CTATGGCGTG AAGCGAACCC TTGATTTGCC ACACGCCGAA CCATTACCTC GTATATGCTA G
|
Protein sequence | MNNIQLAHGS GGQAMQQLIN SLFMEAFANP WLAEQEDQAR LELAQLTAEG DRLAFSTDSY VIDPLFFPGG NIGKLAICGT ANDVAVSGAI PRYLSCGFIL EEGLPMETLK SVVNSMAATA READIAIVTG DTKVVQRGAA DKLFINTAGM GAIPADIRWG AQTLSVGDVL LVSGTLGDHG ATILNLREQL GLDGELASDC AVLTPLIQTL RHIDGVKALR DATRGGVNAV AHEFATSCGY GIELSESALP LKPAVRGVCE LLGLDALNFA NEGKLVIAVE RQAADRALAA LRAHPLGRDA ALIGEVVERK GVRLAGLYGV KRTLDLPHAE PLPRIC
|
| |