Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2342 |
Symbol | |
ID | 6143936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2375643 |
End bp | 2376695 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617215 |
Product | cytochrome c-type biogenesis family protein |
Protein accession | YP_001744387 |
Protein GI | 170680448 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3088] Uncharacterized protein involved in biosynthesis of c-type cytochromes [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.397636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTTT TATTGGGCGT GCTGATGCTG ATGATCTCCG GCTCAGCGCT GGCGACCATC GACGTGTTGC AGTTTAAGGA TGAAGCGCAG GAGCAGCAGT TCCGCCAACT CACTGAAGAA CTACGCTGCC CGAAATGCCA GAACAACAGC ATTGCCGATT CCAACTCGAT GATTGCCACC GACCTGCGCC AGAAAGTGTA TGAACTGATG CAGGAAGGTA AAAGTAAGAA AGAGATTGTC GATTATATGG TGGCGCGTTA CGGCAACTTC GTCACTTACG ATCCGCCGTT AACGCCGCTG ACCGTGCTGC TGTGGGTGCT TCCGGTAGTG GCTATTGGCA TTGGCGGTTG GGTCATTTAC GCCCGTTCGC GGCGTCGGGT ACGCGTGGTG CCGGACGCGT TTCCTGAACA AAGCGTGCCG GAAGGTAAGC GTGCCGGATA TATTGTTTAT CTGCCGGGTA TTGTGGTGGC GTTAATTGTG GCTGGCGTCA GCTACTACCA GACGGGCAAT TATCAGCAGG TGAAAATCTG GCAGCAGGCC ACGGCACAGG CTCCGGCGTT ACTGGACAGG GCGCTGGATC CGAAAGCCGA TCCGCTCAAC GAAGAAGAGA TGTCGCGCCT GGCGCTGGGG ATGCGTACTC AACTGCAAAA AAATCCGGGA GATATAGAAG GCTGGATTAT GTTGGGCCGC GTTGGCATGG CGCTGGGTAA CGCCAGTATT GCCACCGATG CATACGCTAC TGCATATCGC CTCGATCCGA AGAACAGTGA TGCGGCGCTC GGATACGCTG AAGCGTTGAC ACGTTCATCT GATCCCAACG ACAACCGCCT GGGTGGTGAA CTGCTACGCC AGTTGGTGAG AACTGACCAC AGCAATATCC GTGTGTTAAG CATGTATGCG TTTAATGCCT TTGAGCAGCA GCGATTTGGC GAAGCCGTTG CCGCGTGGGA GATGATGTTG AAACTCTTAC CTGCCAATGA TACTCGCCGT GCGGTGATTG AACGTAGTAT CGCGCAGGCG ATGCAACATT TGTCGCCGCA GGAGAGTAAA TAA
|
Protein sequence | MRFLLGVLML MISGSALATI DVLQFKDEAQ EQQFRQLTEE LRCPKCQNNS IADSNSMIAT DLRQKVYELM QEGKSKKEIV DYMVARYGNF VTYDPPLTPL TVLLWVLPVV AIGIGGWVIY ARSRRRVRVV PDAFPEQSVP EGKRAGYIVY LPGIVVALIV AGVSYYQTGN YQQVKIWQQA TAQAPALLDR ALDPKADPLN EEEMSRLALG MRTQLQKNPG DIEGWIMLGR VGMALGNASI ATDAYATAYR LDPKNSDAAL GYAEALTRSS DPNDNRLGGE LLRQLVRTDH SNIRVLSMYA FNAFEQQRFG EAVAAWEMML KLLPANDTRR AVIERSIAQA MQHLSPQESK
|
| |