Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0747 |
Symbol | |
ID | 6269949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 703746 |
End bp | 704798 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641724933 |
Product | cytochrome c-type biogenesis family protein |
Protein accession | YP_001879461 |
Protein GI | 187730638 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3088] Uncharacterized protein involved in biosynthesis of c-type cytochromes [COG4235] Cytochrome c biogenesis factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.238814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTTT TATTGGGCGT GCTGATGCTG ATGATCTCCG GCTCAGCGCT GGCGACCATC GACGTGTTGC AGTTTAAGGA TGAAGCGCAG GAGCAGCAGT TCCGCCAACT CACTGAAGAA CTACGCTGCC CGAAATGCCA GAACAACAGC ATTGCCGATT CCAACTCGAT GATTGCCACC GACCTGCGCC AGAAAGTGTA TGAACTGATG CAGGAAGGTA AAAGTAAAAA AGAGATTGTC GATTATATGG TGGCGCGTTA CGGCAACTTC GTCACTTACG ATCCGCCGTT AACGCCGCTG ACCGTGCTGC TGTGGGTGCT TCCGGTAGTG GCTATTGGCA TTGGCGGTTG GGTCATTTAC GCCCGTTCGC GGCGTCGGGT ACGCGTGGTG CCGGACGCGT TTCCTGAACA AAGCGTGCCG GAAGGTAAGC GTGCCGGATA TATTGTTTAT CTGCCGGGTA TTGTGGTGGC GTTAATTGTG GCTGGCGTCA GCTACTACCA GACTGGCAAT TATCAGCAGG TGAAAATCTG GCAGCAGGCC ACGGCACAGG CTCCGGCGTT GCTGGACAGG GCGCTGGATC CGAAAGCCGA TCCGCTCAAC GAAGAAGAGA TGTCGCGTCT TGCGCTGGGG ATGCGTACTC AACTGCAAAA AAATCCGGGA GATATAGAAG GCTGGATTAT GTTGGGCCGC GTTGGCATGG CGCTGGGTAA CGCCAGTATC GCCACCGATG CATACGCTAC TGCGTATCGC CTCGATCCGA AAAACAGTGA TGCAGCACTG GGTTATGCTG AAGCGTTGAC ACGTTCATCT GATCCCAACG ACAACCGCCT CGGTGGTGAA CTGCTGCGCC AGTTGGTGAG AACGGACCAT AGCAATATCC GTGTGTTAAG CATGTATGCG TTTAATGCCT TTGAGCAGCA GCGATTTGGC GAAGCCGTTG CCGCGTGGGA GATGATGTTG AAACTCTTAT CTGCCAACGA TACTCGCCGT GCGGTGATTG AACGTAGTAT CGCGCAGGCG ATGCAACATT TGTCGCCGCA GGAGAGTAAA TAA
|
Protein sequence | MRFLLGVLML MISGSALATI DVLQFKDEAQ EQQFRQLTEE LRCPKCQNNS IADSNSMIAT DLRQKVYELM QEGKSKKEIV DYMVARYGNF VTYDPPLTPL TVLLWVLPVV AIGIGGWVIY ARSRRRVRVV PDAFPEQSVP EGKRAGYIVY LPGIVVALIV AGVSYYQTGN YQQVKIWQQA TAQAPALLDR ALDPKADPLN EEEMSRLALG MRTQLQKNPG DIEGWIMLGR VGMALGNASI ATDAYATAYR LDPKNSDAAL GYAEALTRSS DPNDNRLGGE LLRQLVRTDH SNIRVLSMYA FNAFEQQRFG EAVAAWEMML KLLSANDTRR AVIERSIAQA MQHLSPQESK
|
| |