Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3962 |
Symbol | hypF |
ID | 6970084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3663337 |
End bp | 3665637 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643387731 |
Product | carbamoyltransferase HypF |
Protein accession | YP_002272174 |
Protein GI | 209397726 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGGCT GGACGGGTTA CGCAATTCAT GATGGCGGGA ATAACTCAAT GGCAAAAAAC ACATCTTGCG GTGTCCAACT GCGTATTCGA GGCAAAGTGC AGGGCGTCGG TTTTCGTCCG TTTGTCTGGC AGCTGGCACA GCAATTAAAT CTTCACGGCG ATGTCTGTAA TGACGGCGAT GGCGTAGAAG TCCGGCTGCT GGAAGACCCG GAAACGTTTC TTGTTCAATT GCATCAGCAC TGCCCGCCGC TGGCGCGTAT TGATAGCGTT GAGCGTGAAC CGTTTATCTG GTCACAACTG CCCACTGAGT TCACCATCCG CCAGAGCGCG GGCGGCGCCA TGAATACGCA AATTGTTCCC GATGCCGCCA CTTGCCCTGC TTGCCTTGCC GAAATGAATA CCCCAGGCGA ACGGCGTTAT CGTTATCCAT TTATCAACTG TACCCACTGC GGCCCGCGCT TCACCATTAT TCGCGCCATG CCTTACGATC GCCCGTTTAC CGTGATGGCG GCGTTTCCGC TGTGTCCAGC CTGTGATAAA GAGTACCGCG ACCCGCTCGA TCGTCGCTTC CACGCCCAGC CGGTGGCCTG CCCGGAGTGT GGCCCGTATC TTGAATGGGT AAGTCATGGT GAACATGCAG AACAAGAGGC GGCATTACAG GCGGCTATCG CACAGTTAAA AATGGGCAAC ATTGTCGCCA TCAAAGGGAT TGGCGGATTT CATCTTGCCT GCGATGCACG TAACAGTAAC GCGGTGGCGA CACTTCGGGC GCGCAAACAT CGCCCGGCGA AACCGCTGGC AGTCATGTTG CCAGTGGCTG ACGGTTTACC AGACGCTGCG CGCCAGTTGC TTACCACGCC CGCCGCGCCG ATTGTGCTGG TGGATAAAAA ATACGTTCCT GAGCTTTGTG ATGATATCGC CCCTGGCCTT AACGAAGTCG GGGTAATGTT GCCTGCGAAT CCGCTTCAGC ATTTGCTGTT ACAGGAACTA CAATGCCCGC TGGTGATGAC CTCTGGCAAC CTGAGCGGTA AACCACCGGC TATCAGCAAC GAACAGGCGC TGGAGGATTT GCAGGGCATT GCCGACGGAT TCTTAATACA CAACCGCGAC ATTGTGCAGC GGATGGATGA TTCGGTGGTG CGCGAAAGCG GCGAAATGCT GCGCCGTTCG CGGGGGTATG TGCCGGATGC GCTGGCTTTG CCTCCGGGCT TTAAAAATGT TCCGCCTGTA CTGTGTCTCG GCGCGGATCT GAAAAATACC TTCTGCCTGG TGCGCGGTGA ACAAGTGGTG TTGAGCCAGC ATCTGGGCGA TTTAAGTGAC GATGGCATCC AGACGCAGTG GCGCGAAGCG TTACGCCTGA TGCAAAACAT CTACAATTTC ACTCCGCAAT ACGTTGTGCA TGACGCACAT CCGGGCTATG TCTCCTGCCA GTGGGCGAGC GAAATGAATC TGCCGACGCA AACGGTGCTG CATCATCATG CCCACGCAGC GGCGTGTCTG GCAGAGCATC AGTGGCCGCT GGATGGCGGT GATGTCATTG CTTTGACGCT CGACGGTATC GGTATGGGGG AGAACGGCGC TTTGTGGGGC GGCGAGTGCC TGCGGGTGAA CTATCGCGAA TGTGAGCACC TGGGCGGCTT GCCTGCAGTG GCGCTTCCGG GTGGCGATTT GGCGGCGAAG CAGCCGTGGC GTAACCTGCT GGCGCAGTGC CTGCGCTTTG TGCCGGAGTG GCAGAATTAC CCTGAAACGG CAAGTGTGCA ACAGCAAAAC TGGAGCGTAC TGGCGCGGGC CATTGAGCGT GGAATTAACG CGCCGCTGGC GTCATCGTGT GGGCGTTTGT TCGATGCTGT GGCGGCGGCA CTGGGCTGTG CGCCAGCCAC GTTAAGTTAT GAAGGTGAAG CGGCTTGTGC TCTGGAGGCG CTAGCAGCCT CATGCGACGG AGTGACGCAT CCGGTGACGA TGCCGCGGGT GGACAATCAA CTGGATCTCG CCACTTTCTG GCAGCAGTGG CTGAACTGGC AGGCACCGGT TAATCAACGC GCGTGGGCGT TTCATGATGC GCTGGCGCAG GGTTTTGCCG CGTTGATGCG TGAGCAGGCC ACGATGCGTG GTATCACTAC GCTGGTATTT AGCGGCGGGG TTATTCATAA CCGTTTGCTG CGTGCACGTC TGGCGCATTA TCTCGCTGAT TTCACATTGC TGTTTCCACA GAGTTTACCG GCGGGTGATG GCGGTTTGTC TCTGGGGCAG GGGGTTATTG CTGCGGCGCG TTGGTTAGCG GGTGAAGTCC AGAACGGATA A
|
Protein sequence | MSGWTGYAIH DGGNNSMAKN TSCGVQLRIR GKVQGVGFRP FVWQLAQQLN LHGDVCNDGD GVEVRLLEDP ETFLVQLHQH CPPLARIDSV EREPFIWSQL PTEFTIRQSA GGAMNTQIVP DAATCPACLA EMNTPGERRY RYPFINCTHC GPRFTIIRAM PYDRPFTVMA AFPLCPACDK EYRDPLDRRF HAQPVACPEC GPYLEWVSHG EHAEQEAALQ AAIAQLKMGN IVAIKGIGGF HLACDARNSN AVATLRARKH RPAKPLAVML PVADGLPDAA RQLLTTPAAP IVLVDKKYVP ELCDDIAPGL NEVGVMLPAN PLQHLLLQEL QCPLVMTSGN LSGKPPAISN EQALEDLQGI ADGFLIHNRD IVQRMDDSVV RESGEMLRRS RGYVPDALAL PPGFKNVPPV LCLGADLKNT FCLVRGEQVV LSQHLGDLSD DGIQTQWREA LRLMQNIYNF TPQYVVHDAH PGYVSCQWAS EMNLPTQTVL HHHAHAAACL AEHQWPLDGG DVIALTLDGI GMGENGALWG GECLRVNYRE CEHLGGLPAV ALPGGDLAAK QPWRNLLAQC LRFVPEWQNY PETASVQQQN WSVLARAIER GINAPLASSC GRLFDAVAAA LGCAPATLSY EGEAACALEA LAASCDGVTH PVTMPRVDNQ LDLATFWQQW LNWQAPVNQR AWAFHDALAQ GFAALMREQA TMRGITTLVF SGGVIHNRLL RARLAHYLAD FTLLFPQSLP AGDGGLSLGQ GVIAAARWLA GEVQNG
|
| |