Gene EcolC_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3953 
Symbol 
ID6064469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4340679 
End bp4342301 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content57% 
IMG OID641603366 
Productheme lyase subunit NrfE 
Protein accessionYP_001726881 
Protein GI170021927 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCTGGC CTGCCATGAT GCGACTTACT TGCATCGGCA TTCTGGCGCA GTTCGCGCTC 
CTGCTGCTCG CCTTTGGCGT ACTGACGTAT TGTTTTCTCA TCAGCGATTT CTCGGTCATT
TATGTCGCCC AACATAGCTA CAGCCTGCTG TCGTGGGAAC TCAAACTGGC GGCGGTATGG
GGTGGTCATG AAGGTTCGCT GCTGCTTTGG GTTCTGCTGC TTTCCGCCTG GAGCACGCTG
TTTGCCTGGC ATTATCGGCA GCAAACCGAT CCGCTATTTC CGCTGACGCT AGCCGTTTTA
TCTCTCATGC TCGCCGCACT GCTACTGTTT GTAGTGCTGT GGTCCGATCC CTTCGTGCGG
ATATTTCCAC CAGCAATCGA AGGCCGCGAT CTCAATCCGA TGCTGCAACA TCCCGGTCTT
ATCTTTCATC CACCGCTGCT TTACCTTGGC TATGGCGGTT TGATGGTAGC GGCGAGCGTG
GCGCTGGCGA GTTTACTGCG CAGCGAGTTT GATGGTGCCT GCGCCCGAAT TTGCTGGCGC
TGGGCACTAC CTGGCTGGAG CGCATTAACG GCGGGGATCA TTCTCGGTTC CTGGTGGGCC
TATTGCGAAC TGGGCTGGGG CGGCTGGTGG TTCTGGGATC CGGTGGAAAA CGCCTCTTTA
TTACCCTGGC TTTCTGCCAC TGCGCTGCTG CACAGTTTGT CCCTGACACG CCAGCGGGGG
ATTTTCCGCC ACTGGTCGCT GTTGCTGGCG ATAGTTACTC TGATGCTGTC GCTGCTGGGC
ACCTTAATTG TCCGTTCTGG CATTCTGGTT TCGGTTCATG CGTTCGCGCT GGATAACGTC
CGCGCCGTGC CGTTGTTCAG CCTGTTTGCA CTGATTAGCC TTGCGTCTCT GGCTCTGTAT
GGCTGGCGAG CGCGGGACGG TGGCCCGGCG GTGCGTTTTT CGGGGTTATC GCGGGAAATG
TTAATCCTCG CTACGCTGTT GCTGTTTTGC GCAGTGCTAC TGATCGTGCT GGTGGGAACT
CTTTATCCGA TGATTTACGG TCTGCTGGGC TGGGGACGCC TCTCCGTTGG CGCGCCGTAT
TTTAACCGCG CGACGTTACC GTTTGGTCTG TTGATGCTGG TGGTGATTGT GCTGGCGACG
TTTGTCTCTG GCAAACGCGT GCAGCTTCCG GCGCTGGTAG CTCATGCTGG CGTGCTGTTA
TTTGCCGCGG GGATCGTGGT TTCCAGCGTC AGTCGTCAGG AGATCAGCCT CAATTTACAG
CCGGGTCAGC AGGTGACGCT GGCAGGATAC ACCTTCCGTT TTGAGCGACT CGATCTGCAA
GCCAGAGGCA ATTACACCAG CGAAAAAGCG ATAGTGGCAC TGTTTGACCA TCAGCAACGT
ATTGGTGAAC TGACGCCGGA GCGGCGTTTT TATGAAGCAC GCCGTCAGCA AATGATGGAA
CCGTCAATTC GCTGGAACGG CATCCATGAC TGGTATGCGG TCATGGGGGA GAAAACTGGG
CCGGATCGTT ACGCTTTTCG TTTGTATGTA CAAAGCGGTG TGCGCTGGAT CTGGGGGGGA
GGATTGTTGA TGATTGCAGG CGCATTATTA AGCGGATGGC GGGGGAGGAA GCGCGATGAA
TAA
 
Protein sequence
MRWPAMMRLT CIGILAQFAL LLLAFGVLTY CFLISDFSVI YVAQHSYSLL SWELKLAAVW 
GGHEGSLLLW VLLLSAWSTL FAWHYRQQTD PLFPLTLAVL SLMLAALLLF VVLWSDPFVR
IFPPAIEGRD LNPMLQHPGL IFHPPLLYLG YGGLMVAASV ALASLLRSEF DGACARICWR
WALPGWSALT AGIILGSWWA YCELGWGGWW FWDPVENASL LPWLSATALL HSLSLTRQRG
IFRHWSLLLA IVTLMLSLLG TLIVRSGILV SVHAFALDNV RAVPLFSLFA LISLASLALY
GWRARDGGPA VRFSGLSREM LILATLLLFC AVLLIVLVGT LYPMIYGLLG WGRLSVGAPY
FNRATLPFGL LMLVVIVLAT FVSGKRVQLP ALVAHAGVLL FAAGIVVSSV SRQEISLNLQ
PGQQVTLAGY TFRFERLDLQ ARGNYTSEKA IVALFDHQQR IGELTPERRF YEARRQQMME
PSIRWNGIHD WYAVMGEKTG PDRYAFRLYV QSGVRWIWGG GLLMIAGALL SGWRGRKRDE