Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2835 |
Symbol | hypF |
ID | 6145832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2910351 |
End bp | 2912639 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617704 |
Product | [NiFe] hydrogenase maturation protein HypF |
Protein accession | YP_001744859 |
Protein GI | 170680945 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0257983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGGCA GGGCGGGTTA CGAAATTCAT GATGGCGGGA ATAACTCAAT GGCAAAAAAC ATATCTTGCG GTGTCCAACT GCGTATTCGT GGCAAAGTGC AGGGCGTCGG TTTTCGTCCG TTTGTCTGGC AACTGGCACA GCAATTAAAT CTTCACGGCG ATGTCTGTAA TGACGGCGAT GGCGTAGAAG TCCGGCTGCT GGAAGCCCCG GAAACGTTTC TTGTTCAATT GCATCAGCAC TGCCCGCCGC TGGCGCGTAT TGATAGCGTC GAGCGTGAGC CGTTTATCTG GTCACAACTG CCCACTGAGT TCACTATCCG CCAGAGCACA GGCGGCACCA TGAATACGCA AATTGTCCCG GATGCCGCCA CTTGCCCTGC TTGCCTTGCC GAAATGAATA CCCCGGGCGA GCGGCGTTAT CGTTATCCGT TTATCAACTG TACCCACTGC GGCCCGCGTT TCACCATTAT TCGCGCGATG CCTTACGACC GCCCGTTTAC CGTGATGGCG GCGTTTCCGC TGTGTCCGGC TTGTGACAAA GAGTACCGCG ACCCGCTCGA TCGTCGCTTC CACGCCCAGC CGGTGGCCTG CCCGGAGTGT GGCCCACATC TTGAATGGGT AAGTCATGGT GAACATGCAG AACAAGAGGC GGCATTACAG GCGGCTATCG CACAGTTAAA AATGGGCAAC ATTGTCGCCA TCAAAGGGAT TGGCGGATTT CATCTCGCCT GCGATGCGCG TAACAGTAAC GCGGTGGCGA CACTACGGGC ACGTAAACAT CGCCCGGCGA AACCGCTGGC GGTCATGTTG CCAGTGGCTG ACGGTTTACC AGACGCTGCG CGCCAGTTGC TTTCCACGCC CGCCGCGCCG ATTGTGCTGG TGGATAAAAA ATACATACCT GAGCTTTGTG ATGATATCGC CCCTGGCCTT AACGAAGTTG GGGTGATGTT GCCTGCGAAC CCGCTCCAGC ATTTGCTGTT ACAGGAACTG CAATGCCCGC TGGTGATGAC CTCCGGCAAC CTGAGCGGTA AACCACCAGC TATCAGCAAC GAACAGGCGC TGGCGGATTT GCAGGGCATT GCCGACGGAT TCTTGATACA TAACCGCGAC ATCGTGCAGC GGATGGATGA TTCGGTGGTG CGCGAAAGCG GCGAAATGCT GCGCCGTTCG CGGGGGTATG TGCCGGATGC GCTGGCTTTG CCTCCGGGCT TTAAAAATGT TCCGCCTGTG CTGTGTCTCG GCGCGGATCT GAAAAATACC TTCTGCCTGG TGCGCGGTGA ACAAGCGGTG TTGAGTCAGC ATCTGGGCGA TTTAAGTGAC GATGGCATCC AGATGCAGTG GCGCGAAGCG TTACGCCTGA TGCAAAACAT CTACGATTTT ACCCCGCAAT ACGTTGTGCA TGACGCACAT CCGGGCTATG TCTCCAGCCA GTGGGCGCGC GAAATGAATC TGCCGACGCA AACGGTGCTG CATCATCATG CCCACGCGGC GGCGTGTCTG GCAGAACATC ACTGGCCACT GGAGGGCGGT GATGTCATTG CTTTGACGCT CGACGGTATC GGTATGGGGG AGAACGGCGC TTTGTGGGGC GGCGAGTGTC TGCGGGTGAA CTATCGCGAA TGCCAGCACC TGGGCGGCTT GCCAGCGGTG GCGCTTGCTG GGGGCGATTT GGCTGCAAAG CAGCCGTGGC GTAACCTGCT GGCGCAGTGC CTGCGCTTTG TGCCGGAGTG GCAGAATTAC CCTGAAACGG CAAGTGTGCA ACAGCAAAAC TGGAGCGTGC TGGCGCGGGC CATTGAGCGT GGAATTAACG CGCCGCTGGC GTCATCGTGT GGGCGTTTGT TTGATGCTGT GGCGGCGGCA CTGGGCTGTG CGCCAGCCAC GTTAAGTTAT GAAGGTGAAG CGGCTTGTGC TCTGGAGGCG CTCGCAGCTT CATGCGACGG AGTGACGCAT CCGGTGACGA TGCCGCTGGT GGACAATCAA CTGGATCTCG CCACTTTCTG GCAGCAGTGG CTGAACTGGC AGGCACCGGT TAATCAACGC GCGTGGGCGT TTCATGATGC GCTGGCGCAG GGTTTTGCCG CGTTGATGCG TGAGCAGGCC ACGATGCGCG GTATCACTAC GCTGGTATTT AGCGGTGGGG TTATTCATAA CCGATTGCTG CGTACACGCC TGGCGCATTA TCTCGCTGAT TTCACATTAC TCTTTCCGCA GAGTTTACCG GCGGGTGATG GCGGCTTGTC TCTGGGGCAG GGGGTTATTG CTGCGGCGCG CATCCTGCAT GCTGCATAG
|
Protein sequence | MSGRAGYEIH DGGNNSMAKN ISCGVQLRIR GKVQGVGFRP FVWQLAQQLN LHGDVCNDGD GVEVRLLEAP ETFLVQLHQH CPPLARIDSV EREPFIWSQL PTEFTIRQST GGTMNTQIVP DAATCPACLA EMNTPGERRY RYPFINCTHC GPRFTIIRAM PYDRPFTVMA AFPLCPACDK EYRDPLDRRF HAQPVACPEC GPHLEWVSHG EHAEQEAALQ AAIAQLKMGN IVAIKGIGGF HLACDARNSN AVATLRARKH RPAKPLAVML PVADGLPDAA RQLLSTPAAP IVLVDKKYIP ELCDDIAPGL NEVGVMLPAN PLQHLLLQEL QCPLVMTSGN LSGKPPAISN EQALADLQGI ADGFLIHNRD IVQRMDDSVV RESGEMLRRS RGYVPDALAL PPGFKNVPPV LCLGADLKNT FCLVRGEQAV LSQHLGDLSD DGIQMQWREA LRLMQNIYDF TPQYVVHDAH PGYVSSQWAR EMNLPTQTVL HHHAHAAACL AEHHWPLEGG DVIALTLDGI GMGENGALWG GECLRVNYRE CQHLGGLPAV ALAGGDLAAK QPWRNLLAQC LRFVPEWQNY PETASVQQQN WSVLARAIER GINAPLASSC GRLFDAVAAA LGCAPATLSY EGEAACALEA LAASCDGVTH PVTMPLVDNQ LDLATFWQQW LNWQAPVNQR AWAFHDALAQ GFAALMREQA TMRGITTLVF SGGVIHNRLL RTRLAHYLAD FTLLFPQSLP AGDGGLSLGQ GVIAAARILH AA
|
| |