Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4319 |
Symbol | nrfE |
ID | 5595073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4323278 |
End bp | 4324936 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640923417 |
Product | heme lyase subunit NrfE |
Protein accession | YP_001460862 |
Protein GI | 157163544 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1138] Cytochrome c biogenesis factor |
TIGRFAM ID | [TIGR00353] c-type cytochrome biogenesis protein CcmF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 71 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGACCC CGTTGACGGC CTTTGCGGGA GTGCGGTTGC GCTGGCCTGC CATGATGCGA CTCACTTGCA TCGGCATTCT GGCGCAGTTC GCGCTCCTGC TGCTCGCCTT TGGCGTACTG ACGTATTGTT TTCTCATCAG CGATTTCTCG GTCATTTATG TCGCCCAACA TAGCTACAGC CTGCTGTCGT GGGAACTCAA ACTGGCGGCG GTGTGGGGCG GCCATGAAGG TTCGCTGCTG CTTTGGGTGC TGCTGCTTTC CGCCTGGAGC GCGCTGTTTG CCTGGCATTA TCGGCAGCAA ACCGATCCGC TATTTCCGCT GACGCTAGCC GTTTTATCTC TCATGCTCGC CGCACTGCTA CTGTTTGTAG TGCTGTGGTC CGATCCCTTC GTGCGGATAT TTCCACCAGC AATCGAAGGC CGCGATCTCA ATCCGATGCT GCAACATCCC GGTCTTATCT TTCATCCACC GCTGCTTTAC CTTGGCTATG GCGGTTTGAT GGTAGCGGCG AGCGTGGCGC TGGCGAGTTT ACTGCGCGGC GAGTTTGATG GTGCCTGCGC CAGAATTTGT TGGCGCTGGG CACTACCTGG CTGGAGCGCA TTAACGGCGG GGATCATCCT CGGTTCTTGG TGGGCCTATT GCGAACTGGG CTGGGGCGGC TGGTGGTTCT GGGATCCGGT GGAAAACGCC TCTTTATTAC CCTGGCTTTC TGCCACTGCG CTGCTGCACA GTTTGTCCCT GACACGCCAG CGGGAGATTT TCCGCCACTG GTCGCTGTTG CTGGCGATAG TAACTCTGAT GCTGTCGCTA CTGGGTACCT TAATTGTCCG TTCTGGCATT CTGGTTTCGG TTCATGCGTT CGCACTGGAT AACGTCCGCG CCGTGCCGTT GTTCAGCCTG TTTGCACTGA TTAGCCTTGC GTCTCTGGCT CTGTATGGCT GGCGAGCGCG GGACGGTGGT GCGGTGGTGC GTTTTTCGGG GTTATCGCGG GAAATGTTAA TCCTCGCTAC GCTGTTGCTG TTTTGCGCAG TGCTACTGAT CGTGCTGGTG GGAACGCTTT ATCCGATGAT TTACGGCCTG CTGGGCTGGG GACGCCTCTC CGTTGGCGCG CCGTATTTTA ACCGCGCGAC GTTACCGTTT GGCCTGTTGA TGCTGGTGGT GATTGTGCTG GCGACGTTTG TCTCTGGCAA ACGCGTGCAG CTTCCGGCGC TGGTAGCTCA TGCGGGCGTG CTGTTATTTG CCGCGGGGAT CGTGGTTTCC AGCGTCAGCC GTCAGGAGAT CAGCCTGAAT TTACAGCCGG GTCAGCAGGT GACGCTGGCA GGATACACCT TCCGTTTTGA GCGCCTCGAT CTGCAAGCCA AAGGCAATTA CACCAGCGAA AAAGCGATAG TGGCACTGTT TGACCATCAG CAACGCATTG GTGAACTGAT GCCGGAGCGG CGTTTTTACG AAGCACGTCG TCAGCAAATG ATGGAACCGT CAATTCGCTG GAACGGCATC CATGACTGGT ATGCGGTCAT GGGTGAAAAA ACCGGAGCGG ATCGTTACGC TTTTCGCTTG TATGTACAAA GCGGTGTGCG CTGGATCTGG GGGGGAGGAT TGTTGATGAT TGCGGGCGCA TTGTTAAGCG GATGGCGGGG GAGGAAGCGC GATGAATAA
|
Protein sequence | MLTPLTAFAG VRLRWPAMMR LTCIGILAQF ALLLLAFGVL TYCFLISDFS VIYVAQHSYS LLSWELKLAA VWGGHEGSLL LWVLLLSAWS ALFAWHYRQQ TDPLFPLTLA VLSLMLAALL LFVVLWSDPF VRIFPPAIEG RDLNPMLQHP GLIFHPPLLY LGYGGLMVAA SVALASLLRG EFDGACARIC WRWALPGWSA LTAGIILGSW WAYCELGWGG WWFWDPVENA SLLPWLSATA LLHSLSLTRQ REIFRHWSLL LAIVTLMLSL LGTLIVRSGI LVSVHAFALD NVRAVPLFSL FALISLASLA LYGWRARDGG AVVRFSGLSR EMLILATLLL FCAVLLIVLV GTLYPMIYGL LGWGRLSVGA PYFNRATLPF GLLMLVVIVL ATFVSGKRVQ LPALVAHAGV LLFAAGIVVS SVSRQEISLN LQPGQQVTLA GYTFRFERLD LQAKGNYTSE KAIVALFDHQ QRIGELMPER RFYEARRQQM MEPSIRWNGI HDWYAVMGEK TGADRYAFRL YVQSGVRWIW GGGLLMIAGA LLSGWRGRKR DE
|
| |