Gene ECH74115_5579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5579 
SymbolnrfE 
ID6969070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5219841 
End bp5221520 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content57% 
IMG OID643389218 
Productheme lyase subunit NrfE 
Protein accessionYP_002273615 
Protein GI209397457 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTAAGTC TCGGGGTCAA CGTGTTGACC CCGTTGACGG CCTTTGCGGG AGTGCGGTTG 
CGCTGGCCTG CCATGATGCG ACTCACTTGC ATCGGCATTC TGGCACAGTT CGCGCTCCTG
CTGCTCGCCT TTGGCGTACT GACGTATTGT TTTCTCATCA GCGATTTCTC GGTCATTTAT
GTCGCCCAAC ATAGCTACAG CTTGCTGTCG TGGAAACTCA AACTGGCGGC GGTGTGGGGC
GGTCATGAAG GTTCGCTGCT GCTTTGGGTG CTGCTGCTTT CCGCCTGGAG CGCGCTGTTT
GCCTGGCAAT ATCGGCAGCA AACCGATCCG CTATTTCCGC TGACGCTAGC CGTTTTATCT
CTCATGCTCG CCGCACTGCT ACTGTTTGTA GTGCTGTGGT CCGATCCCTT CGTGCGGATA
TTTCCACCAG CAATCGAAGG CCGCGATCTC AATCCGATGC TGCAACATCC CGGTCTTATC
TTTCATCCAC CGCTGCTTTA CCTTGGCTAT GGCGGTTTGA TGGTAGCGGC GAGCGTGGCG
CTGGCGAGTC TACTGCGCGG CGAGTTTGAT GGTGCCTGCG CCCGAATTTG CTGGCGCTGG
GCGTTACCTG GCTGGAGTGC ATTAACTGCG GGGATCATCC TCGGTTCCTG GTGGGCCTAC
TGCGAACTCG GCTGGGGCGG CTGGTGGTTC TGGGATCCGG TGGAAAACGC CTCTTTATTA
CCCTGGCTTT CTGCCACTGC GCTGCTGCAC AGTTTATCCC TGACACGCCA GCGGGGGATT
TTTCGCCACT GGTCGCTGTT GTTGGCGATA GTTACTCTGA TGCTGTCGCT GCTGGGCACC
TTAATTGTCC GTTCTGGCAT TCTGGTTTCG GTTCATGCGT TCGCTCTGGA TAACGTTCGC
GCTGTGCCGT TGTTCAGCCT GTTTGCACTG ATTAGCCTTG CGTCTCTGGC TCTGTATGGC
TGGCGAGCGC GGGACGGTGG CCCGGCGGTG CGTTTTTCGG GGTTATCGCG GGAAATGTTA
ATCCTCGCTA CGCTGTTGCT GTTTTGCGCA GTGCTACTGA TCGTGCTGGT GGGAACGCTT
TATCCGATGA TTTACGGCCT GCTGGGCTGG GGACGTCTCT CCGTTGGCGC ACCGTATTTT
AACCGCGCGA CGTTACCGTT TGGTCTGTTG ATGTTGGTGG TGATTGTGCT GGCGACGTTT
GTCTCTGGCA AACGCGCGCA GCTTCCGGCG CTGCTGGCGC ATGCGGGCAT ACTGTTATTT
GCCGCAGGGA TCGTTGTCTC CAGCGTCAGC CGTCAGGAGA TCAGCCTGAA TTTACAGCCG
GGTCAGCAGG TGACGCTGGC AGGATACACC TTCCGTTTTG AGCGCCTCGA TCTGCAAGCC
AAAGGCAATT ACACCAGCGA AAAAGCGATA GTGGCGCTGT TTGACCATCA GCAACGCATT
GGTGAATTAA CGCCGGAGCG GCGTTTTTAC GAAGCTCGTC GTCAGCAAAT GATGGAACCG
TCAATTCGCT GGAACGGCAT CCATGACTGG TATGCGGTCA TGGGGGAGAA AACTGGAGCG
GATCGTTACG CTTTTCGCTT GTATGTACAA AGCGGTGTGC GCTGGATCTG GGGGGGAGGA
TTGTTGATGA TTGCGGGCGC ATTGTTAAGC GGATGGCGGG GGAAGAAGCG CGATGTATAA
 
Protein sequence
MLSLGVNVLT PLTAFAGVRL RWPAMMRLTC IGILAQFALL LLAFGVLTYC FLISDFSVIY 
VAQHSYSLLS WKLKLAAVWG GHEGSLLLWV LLLSAWSALF AWQYRQQTDP LFPLTLAVLS
LMLAALLLFV VLWSDPFVRI FPPAIEGRDL NPMLQHPGLI FHPPLLYLGY GGLMVAASVA
LASLLRGEFD GACARICWRW ALPGWSALTA GIILGSWWAY CELGWGGWWF WDPVENASLL
PWLSATALLH SLSLTRQRGI FRHWSLLLAI VTLMLSLLGT LIVRSGILVS VHAFALDNVR
AVPLFSLFAL ISLASLALYG WRARDGGPAV RFSGLSREML ILATLLLFCA VLLIVLVGTL
YPMIYGLLGW GRLSVGAPYF NRATLPFGLL MLVVIVLATF VSGKRAQLPA LLAHAGILLF
AAGIVVSSVS RQEISLNLQP GQQVTLAGYT FRFERLDLQA KGNYTSEKAI VALFDHQQRI
GELTPERRFY EARRQQMMEP SIRWNGIHDW YAVMGEKTGA DRYAFRLYVQ SGVRWIWGGG
LLMIAGALLS GWRGKKRDV