Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2848 |
Symbol | hypF |
ID | 5592312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2852591 |
End bp | 2854843 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640921965 |
Product | carbamoyltransferase HypF |
Protein accession | YP_001459476 |
Protein GI | 157162158 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT AATGACGGCG ATGGCGTAGA AGTCCGGCTG CTGGAAGACC CGGAAACGTT TCTTGTTCAA TTGCATCAGC ACTGTCCGCC ACTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTTTATC TGGTCACAAC TGCCCACCGA GTTCACTATC CGCCAGAGCG CGGGCGGTGC CATGAATACG CAAATTGTCC CGGATGCTGC CACTTGCCAT GCTTGCCTTG CCGAAATGAA TACCCCAGGC GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG CTTCACCATT ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GCTGTGTCCG GCCTGTGATA AAGAGTACCG TGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC AGAACAAGAG GCGGCATTAC AGGCAGCTAT CGCACAGTTA AAAATGGGCA ACATTGTCGC CATCAAAGGG ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTGCGG GCACGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTTATGT TGCCAGTGGC AGAAGGTTTA CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGGCC TTAACGAAGT CGGGGTGATG TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG ACCTCCGGCA ACCTGAGCGG TAAACCACCG GCTATCAGCA ACGAACAGGC GCTGGCGGAT TTGCAGGGCA TTGCCGACGG ATTCTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT GCGCTGGCTT TGCCTCTGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT CTGAAAAACA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC ATCTACGATT TCACTCCGCA ATACGTTGTG CATGACGCGC ATCCGGGCTA TGTCTCCAGC CAGTGGGCGC GCGAAATGAA TCTGCCGACG CAAACGGTGC TGCATCATCA TGCCCACGCA GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG CTCGACGGTA TCGGTATGGG GGAGAACGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG AACTATCGCG AATGCCAGCA CCTGGGCGGC TTGCCTGCGG TGGCGCTTCC GGGTGGCGAT TTGGCAGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGCTT TGTGCCGGAG TGGCAGAATT ACTCTGAAAC AGCAAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGTTT TTTCGATGCA GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC AATGCCGCGG GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
|
Protein sequence | MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL LEDPETFLVQ LHQHCPPLAR IDSVEREPFI WSQLPTEFTI RQSAGGAMNT QIVPDAATCH ACLAEMNTPG ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGNIVAIKG IGGFHLACDA RNSNAVATLR ARKHRPAKPL AVMLPVAEGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APGLNEVGVM LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNEQALAD LQGIADGFLI HNRDIVQRMD DSVVRESGEM LRRSRGYVPD ALALPLGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECQHLGG LPAVALPGGD LAAKQPWRNL LAQCLRFVPE WQNYSETASV QQQNWSVLAR AIERGINAPL ASSCGRFFDA VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPR VDNQLDLATF WQQWLNWQAP VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG
|
| |