Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2332 |
Symbol | |
ID | 5591289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2332337 |
End bp | 2333389 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921458 |
Product | cytochrome c-type biogenesis family protein |
Protein accession | YP_001458993 |
Protein GI | 157161675 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3088] Uncharacterized protein involved in biosynthesis of c-type cytochromes [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTT TATTGGGCGT GCTGATGCTG ATGATCTCCG GCTCAGCACT GGCGACCATC GACGTGTTGC AGTTTAAAGA TGAAGCACAG GAACAACAGT TCCGTCAGCT CACTGAAGAA CTGCGCTGCC CGAAATGCCA GAACAACAGC ATTGCCGATT CCAACTCGAT GATTGCCACC GACCTGCGTC AGAAAGTGTA TGAACTGATG CAGGAAGGTA AAAGTAAAAA AGAGATTGTC GATTATATGG TGGCGCGTTA CGGCAACTTC GTCACTTACG ATCCGCCGTT AACGCCGCTG ACCGTGCTGC TGTGGGTGCT TCCGGTAGTG GCTATTGGCA TTGGCGGTTG GGTCATATAC GCCCGCTCGC GGCGTCGGGT ACGCGTAGTG CCGGAAGCGT TTCCTGAACA AAGCGTGCAG GAAGGTAAGC GTGCCGGATA TATTGTTTAT CTGCCGGGTA TTGTGGTGGC GTTAATTGTG GCTGGCGTCA GCTACTACCA GACTGGCAAT TATCAGCAGG TGAAAATCTG GCAGCAGGCC ACGGCACAGG CTCCGGCGTT GCTGGACAGG GCGCTGGATC CGAAAACCGA TCCGCTCAAC GAAGAAGAGA TGTCGCGTCT TGCGCTGGGG ATGCGTACTC AACTGCAAAA AAATCCGGGA GATATAGAAG GCTGGATTAT GTTGGGCCGC GTTGGCATGG CGCTGGGTAA CGCTAGTATC GCCACCGATG CATACGCTAC TGCGTATCGC CTCGATCCGA AAAACAGCGA TGCCGCACTG GGTTATGCTG AAGCGTTGAC ACGTTCATCT GATCCCAACG ACAACCGCCT CGGTGGTGAA CTGCTGCGCC AGTTGGTGAG AAGCGACCAC AGCAATATCC GTGTGTTAAG CATGTATGCG TTTAATGCCT TTGAGCAGCA GCGATTTGGC GAAGCCGTTG CCGCGTGGGA GATGATGTTG AAACTCTTAC CTGCCAACGA TACTCGCCGT GCGGTGATTG AACGTAGTAT CGCGCAGGCG ATGCAACATT TGTCGCCGCA GGAGAGTAAA TAA
|
Protein sequence | MRFLLGVLML MISGSALATI DVLQFKDEAQ EQQFRQLTEE LRCPKCQNNS IADSNSMIAT DLRQKVYELM QEGKSKKEIV DYMVARYGNF VTYDPPLTPL TVLLWVLPVV AIGIGGWVIY ARSRRRVRVV PEAFPEQSVQ EGKRAGYIVY LPGIVVALIV AGVSYYQTGN YQQVKIWQQA TAQAPALLDR ALDPKTDPLN EEEMSRLALG MRTQLQKNPG DIEGWIMLGR VGMALGNASI ATDAYATAYR LDPKNSDAAL GYAEALTRSS DPNDNRLGGE LLRQLVRSDH SNIRVLSMYA FNAFEQQRFG EAVAAWEMML KLLPANDTRR AVIERSIAQA MQHLSPQESK
|
| |