Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4536 |
Symbol | nrfE |
ID | 6146496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4637546 |
End bp | 4639228 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641619352 |
Product | heme lyase subunit NrfE |
Protein accession | YP_001746464 |
Protein GI | 170682982 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1138] Cytochrome c biogenesis factor |
TIGRFAM ID | [TIGR00353] c-type cytochrome biogenesis protein CcmF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGTTAA GTCTCGGGGT CAACGTGTTG ACCCCGTTGA CGGCCTTTGC GGGAGTGCGG TTGCGCTGGC CTGCCATGAT GCGACTCACT TGCATCGGCA TTCTGGCGCA GTTCGCGCTC CTGCTGCTCG CCTTTGGCGT ACTGACGTAT TGTTTTCTCA TCAGCGATTT CTCGGTCATT TATGTCGCCC AACATAGCTA CAGCCTGCTG TCGTGGGAAC TCAAACTGGC GGCGGTGTGG GGCGGTCATG AAGGTTCGCT GCTGCTTTGG GTGCTGCTGC TTTCCGCCTG GAGCGCGCTG TTTGCCTGGC ATTATCGGCA GCAAACCGAT CCGCTATTTC CGCTGACGCT AGCCGTTTTA TCTCTCATGC TCGCCGCACT GCTACTGTTT GTAGTGCTGT GGTCCGATCC CTTCGTGCGG ATATTTCCGC CAGCAATCGA AGGCCGCGAT CTCAATCCGA TGCTGCAACA TCCCGGTCTT ATCTTTCATC CACCGCTGCT TTATCTCGGC TACGGCGGTT TGATGGTGGC GGCGAGCGTG GCGCTGGCGA GTTTACTGCG CGGCGAGTTT GATGGTACCT GCGCCCGAAT TTGCTGGCGC TGGGCACTAC CAGGCTGGGG GGCATTAACG GCGGGGATCA TCCTCGGTTC CTGGTGGGCC TATTGCGAAC TGGGCTGGGG CGGCTGGTGG TTCTGGGATC CGGTGGAAAA CGCCTCTTTA TTACCCTGGC TTTCTGCCAC TGCGCTGCTG CACAGTTTGT CCCTGACACG CCAGCGGGGG ATTTTCCGCC ACTGGTCGCT GTTACTGGCG ATAGTTACTC TGATGCTGTC GCTGCTGGGC ACCTTAATTG TCCGTTCTGG CATTCTGGTT TCGGTTCATG CGTTCGCGCT GGATAACGTT CGCGCCGTGC CGTTGTTCAG CCTGTTTGCA CTGATTAGCC TTGCGTCTCT GACTCTGTAT GGCTGGCGAG CGCGGGACGG TGGCCCGGCG GTGCGTTTTT CGGGGTTATC GCGGGAAATG TTAATCCTCG CTACGCTGTT GCTGTTTTGC GCAGTGCTAC TGATCGTGCT GGTGGGAACG CTTTATCCGA TGATTTACGG CCTGCTGGGC TGGGGACGCC TCTCCGTTGG CGCGCCGTAT TTTAACCGTG CGACGTTACC GTTTGGTCTG TTGATGCTGG TGGTGATTGT GCTGGCGACG TTTGTCTCTG GCAAACGCGT GCAGCTTCCG GCGCTGCTGG CGCATGCGGG CGTGCTGTTA TTTGCCGCAG GGATCGTGGT CTCCAGCGTC AGCCGTCAGG AGATCAGCCT GAATTTACAG CCGGGTCAGC AGGTGACGCT GGCAGGATAC GCCTTCCGTT TTGAGCGTCT CGATCTGCAA GCCAGAGGCA ATTACACCAG CGAAAAAGCG ATAGTGGCAC TGTTTGACCA TCAGCAACGT ATTGGTGAAC TGACGCCGGA GCGGCGTTTT TATGAAGCAC GCCGTCAGCA AATGATGGAA CCGTCAATTC GCTGGAACGG CATCCATGAC TGGTATGCGG TCATGGGGGA GAAAACTGGG CCGGATCGTT ACGCTTTTCG TTTGTATGTA CAAAGCGGTG TGCGCTGGAT CTGGGGGGGA GGATTGTTGA TGATTGCGGG CGCATTGTTA AGCGGATGGC GGGGGAGGAA GCGCGATGAA TAA
|
Protein sequence | MLLSLGVNVL TPLTAFAGVR LRWPAMMRLT CIGILAQFAL LLLAFGVLTY CFLISDFSVI YVAQHSYSLL SWELKLAAVW GGHEGSLLLW VLLLSAWSAL FAWHYRQQTD PLFPLTLAVL SLMLAALLLF VVLWSDPFVR IFPPAIEGRD LNPMLQHPGL IFHPPLLYLG YGGLMVAASV ALASLLRGEF DGTCARICWR WALPGWGALT AGIILGSWWA YCELGWGGWW FWDPVENASL LPWLSATALL HSLSLTRQRG IFRHWSLLLA IVTLMLSLLG TLIVRSGILV SVHAFALDNV RAVPLFSLFA LISLASLTLY GWRARDGGPA VRFSGLSREM LILATLLLFC AVLLIVLVGT LYPMIYGLLG WGRLSVGAPY FNRATLPFGL LMLVVIVLAT FVSGKRVQLP ALLAHAGVLL FAAGIVVSSV SRQEISLNLQ PGQQVTLAGY AFRFERLDLQ ARGNYTSEKA IVALFDHQQR IGELTPERRF YEARRQQMME PSIRWNGIHD WYAVMGEKTG PDRYAFRLYV QSGVRWIWGG GLLMIAGALL SGWRGRKRDE
|
| |