Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1000 |
Symbol | |
ID | 6067708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1086637 |
End bp | 1088889 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641600408 |
Product | (NiFe) hydrogenase maturation protein HypF |
Protein accession | YP_001723996 |
Protein GI | 170019042 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0068] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR00143] [NiFe] hydrogenase maturation protein HypF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000948603 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAAA ACACATCTTG CGGTGTCCAA CTGCGTATTC GTGGCAAAGT GCAGGGCGTC GGTTTTCGTC CGTTTGTCTG GCAGCTGGCA CAGCAATTAA ATCTTCACGG CGATGTCTGT AATGACGGCG ATGGCGTAGA AGTCCGGCTG CGGGAAGACC CGGAAACGTT TCTTGTTCAA TTGTATCAGC ACTGCCCGCC GCTGGCGCGT ATTGATAGCG TCGAGCGTGA GCCGTTTATC TGGTCACAAC TGCCCACCGA GTTCACTATA CGCCAGAGCA CAGGCGGCAC CATGAATACG CAAATTGTTC CCGATGCCGC TACTTGCCCT GCTTGCCTTG CCGAAATGAA TACCCCAGGC GAACGGCGTT ATCGTTATCC GTTTATCAAC TGTACCCACT GCGGCCCGCG TTTCACCATT ATTCGCGCCA TGCCTTACGA CCGCCCGTTT ACCGTGATGG CGGCGTTTCC GCTATGTCCG GCCTGTGACA AAGAGTACCG TGACCCGCTC GATCGTCGCT TCCACGCCCA GCCGGTGGCC TGCCCGGAGT GTGGCCCGCA TCTTGAATGG GTAAGTCATG GTGAACATGC GGAACAAGAG GCGGCATTAC AGGCAGCTAT CGCACAGTTA AAAATGGGCA AAATTGTCGC CATCAAAGGG ATTGGCGGAT TTCATCTTGC CTGCGATGCA CGTAACAGTA ACGCGGTGGC GACACTTCGG GCGCGCAAAC ATCGCCCGGC GAAACCGCTG GCGGTCATGT TGCCAGTGGC TGACGGTTTA CCAGACGCTG CGCGCCAGTT GCTTACCACG CCCGCCGCGC CGATTGTGCT GGTGGATAAA AAATACGTTC CTGAGCTTTG TGATGATATC GCCCCTGACC TTAACGAAGT CGGGGTAATG TTGCCTGCGA ACCCGCTCCA GCATTTGCTG TTACAGGAAC TGCAATGCCC GCTGGTGATG ACCTCCGGCA ACCTGAGCGG TAAACCACCA GCTATCAGCA ACAAACAGGC GCTGGCGGAT TTGCAGGGCA TTGCCGACGG ATTTTTGATA CATAACCGCG ACATCGTGCA GCGGATGGAT GATTCGGTGG TGCGCGAAAG CGGCGAAATG CTGCGCCGTT CGCGGGGGTA TGTGCCGGAT GCGCTGGCTT TGCCTCCGGG CTTTAAAAAT GTTCCGCCTG TGCTGTGTCT CGGCGCGGAT CTGAAAAATA CCTTCTGCCT GGTGCGCGGT GAACAAGCGG TGTTGAGTCA GCATCTGGGC GATTTAAGTG ACGATGGCAT CCAGATGCAG TGGCGCGAAG CGTTACGCCT GATGCAAAAC ATCTACGATT TCACCCCGCA ATACGTTGTG CATGACGCAC ATCCGGGCTA TGTCTCCAGC CAGTGGGCGC GCGAAATGAA TCTGCCGACG CAAACGGTGC TGCATCATCA TGCCCACGCA GCGGCGTGTC TGGCAGAGCA TCAGTGGCCG CTGGATGGCG GTGATGTCAT TGCTTTGACG CTCGACGGTA TCGGTATGGG GGAGAATGGC GCTTTGTGGG GCGGCGAGTG CCTGCGGGTG AACTATCGCG AATGTGAGCA CCTGGGCGGC TTGCCTGCAG TGGCGCTTGC TGGTGGCGAT TTGGCGGCGA AGCAGCCGTG GCGAAACCTG CTGGCGCAGT GCCTGCGTTT TGTGCCGGAG TGGCAGAATT ATCCCGAAAC AGCGAGTGTG CAACAGCAAA ACTGGAGCGT GCTGGCGCGG GCCATTGAGC GTGGAATTAA CGCGCCGCTG GCGTCATCGT GTGGGCGTTT TTTCGATGCA GTGGCGGCGG CACTGGGCTG TGCGCCAGCC ACGTTAAGTT ATGAAGGTGA AGCGGCTTGT GCTCTGGAGG CGCTCGCAGC CTCATGCCAC GGAGTGACGC ATCCGGTGAC AATGCCGCGG GTGGACAATC AACTGGATCT CGCCACTTTC TGGCAGCAGT GGCTGAACTG GCAGGCACCG GTTAATCAAC GCGCGTGGGC GTTTCATGAT GCGCTGGCGC AGGGTTTTGC CGCGTTGATG CGTGAGCAGG CCACGATGCG TGGTATCACT ACGCTGGTAT TTAGCGGCGG GGTTATTCAT AACCGTTTGC TGCGTGCACG TCTGGCGCAT TATCTCGCTG ATTTCACATT GCTCTTTCCA CAGAGTTTAC CGGCGGGTGA TGGCGGTTTG TCTCTGGGGC AGGGGGTTAT TGCTGCGGCG CGTTGGTTAG CGGGTGAAGT CCAGAACGGA TAA
|
Protein sequence | MAKNTSCGVQ LRIRGKVQGV GFRPFVWQLA QQLNLHGDVC NDGDGVEVRL REDPETFLVQ LYQHCPPLAR IDSVEREPFI WSQLPTEFTI RQSTGGTMNT QIVPDAATCP ACLAEMNTPG ERRYRYPFIN CTHCGPRFTI IRAMPYDRPF TVMAAFPLCP ACDKEYRDPL DRRFHAQPVA CPECGPHLEW VSHGEHAEQE AALQAAIAQL KMGKIVAIKG IGGFHLACDA RNSNAVATLR ARKHRPAKPL AVMLPVADGL PDAARQLLTT PAAPIVLVDK KYVPELCDDI APDLNEVGVM LPANPLQHLL LQELQCPLVM TSGNLSGKPP AISNKQALAD LQGIADGFLI HNRDIVQRMD DSVVRESGEM LRRSRGYVPD ALALPPGFKN VPPVLCLGAD LKNTFCLVRG EQAVLSQHLG DLSDDGIQMQ WREALRLMQN IYDFTPQYVV HDAHPGYVSS QWAREMNLPT QTVLHHHAHA AACLAEHQWP LDGGDVIALT LDGIGMGENG ALWGGECLRV NYRECEHLGG LPAVALAGGD LAAKQPWRNL LAQCLRFVPE WQNYPETASV QQQNWSVLAR AIERGINAPL ASSCGRFFDA VAAALGCAPA TLSYEGEAAC ALEALAASCH GVTHPVTMPR VDNQLDLATF WQQWLNWQAP VNQRAWAFHD ALAQGFAALM REQATMRGIT TLVFSGGVIH NRLLRARLAH YLADFTLLFP QSLPAGDGGL SLGQGVIAAA RWLAGEVQNG
|
| |