Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03906 |
Symbol | nrfE |
ID | 8113010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 4197014 |
End bp | 4198672 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644850059 |
Product | hypothetical protein |
Protein accession | YP_003001632 |
Protein GI | 251787328 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1138] Cytochrome c biogenesis factor |
TIGRFAM ID | [TIGR00353] c-type cytochrome biogenesis protein CcmF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGACCC CGTTGACGGC CTTTGCGGGA GTGCGGTTGC GCTGGCCTGC CATGATGCGA CTCACTTGCA TCGGCATTCT GGCGCAGTTC GCGCTCCTGC TGCTCGCCTT TGGCGTACTG ACGTATTGTT TTCTCATCAG CGATTTCTCG GTCATTTATG TCGCGCAACA TAGCTACAGC CTGCTGTCGT GGGAACTCAA GCTGGCAGCT GTGTGGGGCG GCCATGAAGG TTCGCTGCTG CTTTGGGTGC TGCTGCTTTC CGCCTGGAGC GCGCTGTTTG CCTGGCATTA TCGGCAGCAA ACCGATCCGC TATTTCCGCT GACGCTAGCC GTTTTATCTC TCATGCTCGC CGCACTGCTA CTGTTTGTGG TGCTGTGGTC CGATCCCTTC GTGCGGATAT TTCCACCAGC AATCGAAGGC CGCGATCTCA ATCCGATGCT GCAACATCCC GGTCTTATCT TTCATCCACC GCTGCTTTAT CTCGGGTATG GCGGTTTGAT GGTGGCGGCG AGCGTGGCGC TGGCGAGTTT ACTGCGCAGC GAGTTTGATG GTGCCTGCGC CCGAATTTGC TGGCGCTGGG CACTACCTGG CTGGAGCGCA TTAACGGCGG GGATCATTCT CGGTTCCTGG TGGGCCTATT GCGAACTGGG CTGGGGCGGC TGGTGGTTCT GGGATCCGGT GGAAAACGCC TCTTTATTAC CCTGGCTTTC TGCCACTGCG CTGCTGCACA GTTTATCTCT GACACGTCAG CGGGGGATTT TCCGCCACTG GTCGCTGCTG CTGGCGATAG TTACTCTGAT GCTGTCGCTG CTGGGCACCT TAATTGTCCG TTCTGGCATT CTGGTTTCGG TTCATGCGTT CGCGCTGGAT AACGTCCGCG CCGTGCCGTT GTTCAGCCTG TTTGCACTGA TTAGCCTTGC GTCTCTGGCT CTGTATGGCT GGCGAGCGCG GGACGGTGGC CCGGCGGTGC GTTTTTCGGG GTTATCGCGG GAAATGTTAA TCCTCGCTAC GCTGTTGCTG TTTTGCGCAG TGCTACTGAT CGTGCTGGTG GGAACGCTTT ATCCGATGAT TTACGGCCTG CTGGGCTGGG GACGCCTCTC CGTTGGCGCA CCTTATTTTA ACCGCGCGAC GTTACCGTTT GGTCTGTTGA TGCTGGTGGT GATTGTGCTG GCGACGTTTG TCTCTGGCAA ACGCGCGCAG CTTCCGGCGC TGGTAGCTCA TGCGGGCGTG CTGTTATTTG CCGCTGGGGT CGTGGTTTCC AGCGTCAGCC GTCAGGAAAT CAGCCTGAAT TTACAGCCGG GTCAGCCGGT GACGCTGGCA GGATACACCT TCCGTTTTGA GCGCCTCGAT CTGCAAGCCA AAGGCAATTA CACCAGCGAA AAAGCGATAG TGGCGCTGTT TGACCATCAG CAACGCATTG GTGAATTAAC GCCGGAGCGG CGTTTTTACG AAGCTCGTCG TCAGCAAATG ATGGAACCGT CAATTCGCTG GAACGGCATC CATGACTGGT ATGCGGTCAT GGGGGAGAAA ACTGGGCTGG ATCGTTACGC TTTTCGTTTG TATGTACAAA GCGGTGTGCG CTGGATCTGG GGGGGAGGAT TGTTGATGAT TGCAGGCGCA TTATTAAGCG GATGGCGGGG GAGGAAGCGC GATGAGTAA
|
Protein sequence | MLTPLTAFAG VRLRWPAMMR LTCIGILAQF ALLLLAFGVL TYCFLISDFS VIYVAQHSYS LLSWELKLAA VWGGHEGSLL LWVLLLSAWS ALFAWHYRQQ TDPLFPLTLA VLSLMLAALL LFVVLWSDPF VRIFPPAIEG RDLNPMLQHP GLIFHPPLLY LGYGGLMVAA SVALASLLRS EFDGACARIC WRWALPGWSA LTAGIILGSW WAYCELGWGG WWFWDPVENA SLLPWLSATA LLHSLSLTRQ RGIFRHWSLL LAIVTLMLSL LGTLIVRSGI LVSVHAFALD NVRAVPLFSL FALISLASLA LYGWRARDGG PAVRFSGLSR EMLILATLLL FCAVLLIVLV GTLYPMIYGL LGWGRLSVGA PYFNRATLPF GLLMLVVIVL ATFVSGKRAQ LPALVAHAGV LLFAAGVVVS SVSRQEISLN LQPGQPVTLA GYTFRFERLD LQAKGNYTSE KAIVALFDHQ QRIGELTPER RFYEARRQQM MEPSIRWNGI HDWYAVMGEK TGLDRYAFRL YVQSGVRWIW GGGLLMIAGA LLSGWRGRKR DE
|
| |