Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1810 |
Symbol | |
ID | 5670212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2172224 |
End bp | 2173108 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240731 |
Product | cytochrome c class I |
Protein accession | YP_001506154 |
Protein GI | 158313646 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | [TIGR00782] cytochrome c oxidase, cbb3-type, subunit III |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.780394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.101949 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAT CCAAGGCATC CGGAGCCGCG GCGTCCCCGA CCTCGTCCGC GGGCGAGGCC GGCAGACCCC GGCTCATCTC GGTGAGGAAC GCCGGGCGAT CCCGGTCGAC CTCCGCGCGC CGCCGACGGC GGTCGTCGTC CATCGTTCTT CTCCTCGGCC TGCTGGCGAC GGGGGTGATG TGGACGGTGC TCGCTCCAGG CGGCAACGCG ACGGAGACGC CGGACGCCAA CGAGGCCGTG CGCCAGGGCC GCGCCCTGTT CCTGCAGGGT TGCGCGACCT GCCACGGGCT GAACGCCCAG GGCTCGACCG AGGGCCCGAG CCTGATCGGC GTCGGCGCGG CCGCGGTGGA CTTCCAGATG TCCACCGGCC GCATGCCGCT CGCCGCGCCG GCGGCCCAGG CCGACCGCAA GCCCCCGTCG TACTCGGAGA CCCAGATCGA CCAGATCGCG GCCTACGTCC AGACCCTGGG CGGCGGTACG CAGGTGCCCG AGCTGACGGA CGCGGATCTC AACGACGCCG ACCTCGCCGA GGGCGGTGAG CTCTTCCTCG CGAACTGCGC CCAGTGCCAC CAGGCCGCCG GCGCCGGCGC CCCGCTCACC TACGGCAAGT TCGCGCCGTC GCTGAGCCAG GCCACGCCCG AACAGGTCGT CGAGGCGATG CGGACCGGCC CGGAGTCGAT GCCGGTGTTC GGTCCCGGTC AGATCAACGA TGACGAGGCC GTCGCGGTCG CCGCGTACGT CCGTCACCTC CAGGACACCC CGTCACCCGG CGGTTCCTCG ATCGGTAAGT ACGGGCCGGT CCCGGAGGGC CTGGTCGCGT GGGTGATAGG CATCGGCGCG CTGCTCGGCG TCTGCCTCTG GATCGGAGCG AGGCAGAAGC TGTGA
|
Protein sequence | MTTSKASGAA ASPTSSAGEA GRPRLISVRN AGRSRSTSAR RRRRSSSIVL LLGLLATGVM WTVLAPGGNA TETPDANEAV RQGRALFLQG CATCHGLNAQ GSTEGPSLIG VGAAAVDFQM STGRMPLAAP AAQADRKPPS YSETQIDQIA AYVQTLGGGT QVPELTDADL NDADLAEGGE LFLANCAQCH QAAGAGAPLT YGKFAPSLSQ ATPEQVVEAM RTGPESMPVF GPGQINDDEA VAVAAYVRHL QDTPSPGGSS IGKYGPVPEG LVAWVIGIGA LLGVCLWIGA RQKL
|
| |