Gene EcSMS35_4536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4536 
SymbolnrfE 
ID6146496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4637546 
End bp4639228 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content57% 
IMG OID641619352 
Productheme lyase subunit NrfE 
Protein accessionYP_001746464 
Protein GI170682982 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGTTAA GTCTCGGGGT CAACGTGTTG ACCCCGTTGA CGGCCTTTGC GGGAGTGCGG 
TTGCGCTGGC CTGCCATGAT GCGACTCACT TGCATCGGCA TTCTGGCGCA GTTCGCGCTC
CTGCTGCTCG CCTTTGGCGT ACTGACGTAT TGTTTTCTCA TCAGCGATTT CTCGGTCATT
TATGTCGCCC AACATAGCTA CAGCCTGCTG TCGTGGGAAC TCAAACTGGC GGCGGTGTGG
GGCGGTCATG AAGGTTCGCT GCTGCTTTGG GTGCTGCTGC TTTCCGCCTG GAGCGCGCTG
TTTGCCTGGC ATTATCGGCA GCAAACCGAT CCGCTATTTC CGCTGACGCT AGCCGTTTTA
TCTCTCATGC TCGCCGCACT GCTACTGTTT GTAGTGCTGT GGTCCGATCC CTTCGTGCGG
ATATTTCCGC CAGCAATCGA AGGCCGCGAT CTCAATCCGA TGCTGCAACA TCCCGGTCTT
ATCTTTCATC CACCGCTGCT TTATCTCGGC TACGGCGGTT TGATGGTGGC GGCGAGCGTG
GCGCTGGCGA GTTTACTGCG CGGCGAGTTT GATGGTACCT GCGCCCGAAT TTGCTGGCGC
TGGGCACTAC CAGGCTGGGG GGCATTAACG GCGGGGATCA TCCTCGGTTC CTGGTGGGCC
TATTGCGAAC TGGGCTGGGG CGGCTGGTGG TTCTGGGATC CGGTGGAAAA CGCCTCTTTA
TTACCCTGGC TTTCTGCCAC TGCGCTGCTG CACAGTTTGT CCCTGACACG CCAGCGGGGG
ATTTTCCGCC ACTGGTCGCT GTTACTGGCG ATAGTTACTC TGATGCTGTC GCTGCTGGGC
ACCTTAATTG TCCGTTCTGG CATTCTGGTT TCGGTTCATG CGTTCGCGCT GGATAACGTT
CGCGCCGTGC CGTTGTTCAG CCTGTTTGCA CTGATTAGCC TTGCGTCTCT GACTCTGTAT
GGCTGGCGAG CGCGGGACGG TGGCCCGGCG GTGCGTTTTT CGGGGTTATC GCGGGAAATG
TTAATCCTCG CTACGCTGTT GCTGTTTTGC GCAGTGCTAC TGATCGTGCT GGTGGGAACG
CTTTATCCGA TGATTTACGG CCTGCTGGGC TGGGGACGCC TCTCCGTTGG CGCGCCGTAT
TTTAACCGTG CGACGTTACC GTTTGGTCTG TTGATGCTGG TGGTGATTGT GCTGGCGACG
TTTGTCTCTG GCAAACGCGT GCAGCTTCCG GCGCTGCTGG CGCATGCGGG CGTGCTGTTA
TTTGCCGCAG GGATCGTGGT CTCCAGCGTC AGCCGTCAGG AGATCAGCCT GAATTTACAG
CCGGGTCAGC AGGTGACGCT GGCAGGATAC GCCTTCCGTT TTGAGCGTCT CGATCTGCAA
GCCAGAGGCA ATTACACCAG CGAAAAAGCG ATAGTGGCAC TGTTTGACCA TCAGCAACGT
ATTGGTGAAC TGACGCCGGA GCGGCGTTTT TATGAAGCAC GCCGTCAGCA AATGATGGAA
CCGTCAATTC GCTGGAACGG CATCCATGAC TGGTATGCGG TCATGGGGGA GAAAACTGGG
CCGGATCGTT ACGCTTTTCG TTTGTATGTA CAAAGCGGTG TGCGCTGGAT CTGGGGGGGA
GGATTGTTGA TGATTGCGGG CGCATTGTTA AGCGGATGGC GGGGGAGGAA GCGCGATGAA
TAA
 
Protein sequence
MLLSLGVNVL TPLTAFAGVR LRWPAMMRLT CIGILAQFAL LLLAFGVLTY CFLISDFSVI 
YVAQHSYSLL SWELKLAAVW GGHEGSLLLW VLLLSAWSAL FAWHYRQQTD PLFPLTLAVL
SLMLAALLLF VVLWSDPFVR IFPPAIEGRD LNPMLQHPGL IFHPPLLYLG YGGLMVAASV
ALASLLRGEF DGTCARICWR WALPGWGALT AGIILGSWWA YCELGWGGWW FWDPVENASL
LPWLSATALL HSLSLTRQRG IFRHWSLLLA IVTLMLSLLG TLIVRSGILV SVHAFALDNV
RAVPLFSLFA LISLASLTLY GWRARDGGPA VRFSGLSREM LILATLLLFC AVLLIVLVGT
LYPMIYGLLG WGRLSVGAPY FNRATLPFGL LMLVVIVLAT FVSGKRVQLP ALLAHAGVLL
FAAGIVVSSV SRQEISLNLQ PGQQVTLAGY AFRFERLDLQ ARGNYTSEKA IVALFDHQQR
IGELTPERRF YEARRQQMME PSIRWNGIHD WYAVMGEKTG PDRYAFRLYV QSGVRWIWGG
GLLMIAGALL SGWRGRKRDE