Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0291 |
Symbol | |
ID | 5104927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 247618 |
End bp | 249306 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506197 |
Product | cytochrome b/b6 domain-containing protein |
Protein accession | YP_001190392 |
Protein GI | 146303076 |
COG category | [C] Energy production and conversion |
COG ID | [COG1290] Cytochrome b subunit of the bc complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0120651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.455263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAA CTTCACAGGA AAAAAAGAAG GGTCTAATAG ATCAAATTCT TGATAGGGTA GGGGTAACTG AGGCTCCGTT CTTTAAGACC CCCGACTACA TGTATAACAT TTCCTACTGG CTTGGTGCCA TGGTCTCAGC TGCTTTTATC TACACTGTAA TCACCGGGCT CTTCCTTTTA CTTTACTATA TGCCAGCTAA TCCTTACCCT CAGACACAGG CGATAATTAA TACTGTGCCC TATGGGTCGG TCATCCTTTT CAGTCACCTT TATGGAGCTT ACATCATGAT CATCCTAGCG TATATCCACA TGTTTAGGAA TTTCTATAAG GGAGCTTATA AAAAGCCTAG GGAACTGCAA TGGGTAACTG GAGTTTTGTT ACTCGCTCTG ACCATGGGTG CTTCATTCTT TGGCTATAGC CTCGTGAGCG ACGTACTGGG AGTCAATGCA ATAAACATCG GGGAAAGCCT TCTTGTGGGA ACTGGGTTCC CAGGTGCTTC CACTATTGCA AATTGGCTAT TTGGACCCGG AGGTGACGCA GCTACAGCCA GTAACCCACT GGTTAAATCG CAGCTATTTG ATAGGCTTCT TGGCTGGCAC ATCATCATGG TGTTCCTGAT AGGCCTTCTA TTTGGTGTTC ACTTCCTCAT GTCAGAGAGA TACGGTATGA CTCCCTCTGC TAAGGAAAAG CCCAAGGTTC CAGCATATTA CACAAAGGAG GAGTGGTCTA AGTTCAACGA GTGGTGGCCC AGGAACGTGG TTTACATGCT ATCAATCGTG CTGATGACCT GGGGAATTAT CCTGTTTATC CCAGATCTAC TAGCTAACAT CAACGGCCTA CCCATTGTGA TCAATCCTTA CCCTGCACCT GAACCCGGGA CCGCAGCAGC TCTTTCCACA CAACCCTACC CACCTTGGTT CTTCCTATTT CTATACAAGT TTGTGGACTT TGAGCTTCCC AATGGACAGG CCATGAGTCC AGCTCAGGCA CTTTCAATAC TAGTGGTTAT TCTAATGGTG CTGATTCTAA TGCCCTTCTT TGAGAACAGC GAATACATGT TCCTAAGGAA CAGGAAGTTC TGGACGTGGA TAATGACCGT AGCGTGGGTG TCCCTCATAG AGTTAAGCGT ATGGGGATAC CTAGCTCCAG GAGTTCCAGC TCCAACTAGC CAACAGGTGG AGATTCTCGG AATCCCAGCG GTAATAATAG GTCTTGTTAT TCTTGCTACA GGTAGACAAA AAAGCTCAAA GTCAGCACCT TCTCTTAATG CACCAGAAAC TGTTCCAAAA ATTGGACCAA CATCAATCCT TGGAACTGCA ATAGCCTCGC TACTGTTCGC CGGCAGTTTC GGAGTTTGGT TAATGCACCC TGTCATGATT AACATGATAA TGATGATACC CTTTGGGGCA TTAGCTGTTT ACATGGTGTA TAGGATGGCC TCTGGAATGA GGGTTAAAGT TGCCAAGCCC GCGGGTACAA TGACATGGGA GGAGGTTAAG TTTAGGAAAA CAATTGCCTT GTTTGCCCTT CCAGCAATTC TAGTGGTAAC AGCTATACAG ACTGCAATAA TGTGGAAACT TCCAAGTGTG GGACCACAGG CTACCTACGC TGGAATGGAT CTAGGTATAC TCCTGTTCCT ATGGGGTGTG GCCATCCAGC TTTATCATTA CATAGTTTAT GTTAGGTGA
|
Protein sequence | MTETSQEKKK GLIDQILDRV GVTEAPFFKT PDYMYNISYW LGAMVSAAFI YTVITGLFLL LYYMPANPYP QTQAIINTVP YGSVILFSHL YGAYIMIILA YIHMFRNFYK GAYKKPRELQ WVTGVLLLAL TMGASFFGYS LVSDVLGVNA INIGESLLVG TGFPGASTIA NWLFGPGGDA ATASNPLVKS QLFDRLLGWH IIMVFLIGLL FGVHFLMSER YGMTPSAKEK PKVPAYYTKE EWSKFNEWWP RNVVYMLSIV LMTWGIILFI PDLLANINGL PIVINPYPAP EPGTAAALST QPYPPWFFLF LYKFVDFELP NGQAMSPAQA LSILVVILMV LILMPFFENS EYMFLRNRKF WTWIMTVAWV SLIELSVWGY LAPGVPAPTS QQVEILGIPA VIIGLVILAT GRQKSSKSAP SLNAPETVPK IGPTSILGTA IASLLFAGSF GVWLMHPVMI NMIMMIPFGA LAVYMVYRMA SGMRVKVAKP AGTMTWEEVK FRKTIALFAL PAILVVTAIQ TAIMWKLPSV GPQATYAGMD LGILLFLWGV AIQLYHYIVY VR
|
| |