Gene Msed_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0291 
Symbol 
ID5104927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp247618 
End bp249306 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content46% 
IMG OID640506197 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001190392 
Protein GI146303076 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0120651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.455263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA CTTCACAGGA AAAAAAGAAG GGTCTAATAG ATCAAATTCT TGATAGGGTA 
GGGGTAACTG AGGCTCCGTT CTTTAAGACC CCCGACTACA TGTATAACAT TTCCTACTGG
CTTGGTGCCA TGGTCTCAGC TGCTTTTATC TACACTGTAA TCACCGGGCT CTTCCTTTTA
CTTTACTATA TGCCAGCTAA TCCTTACCCT CAGACACAGG CGATAATTAA TACTGTGCCC
TATGGGTCGG TCATCCTTTT CAGTCACCTT TATGGAGCTT ACATCATGAT CATCCTAGCG
TATATCCACA TGTTTAGGAA TTTCTATAAG GGAGCTTATA AAAAGCCTAG GGAACTGCAA
TGGGTAACTG GAGTTTTGTT ACTCGCTCTG ACCATGGGTG CTTCATTCTT TGGCTATAGC
CTCGTGAGCG ACGTACTGGG AGTCAATGCA ATAAACATCG GGGAAAGCCT TCTTGTGGGA
ACTGGGTTCC CAGGTGCTTC CACTATTGCA AATTGGCTAT TTGGACCCGG AGGTGACGCA
GCTACAGCCA GTAACCCACT GGTTAAATCG CAGCTATTTG ATAGGCTTCT TGGCTGGCAC
ATCATCATGG TGTTCCTGAT AGGCCTTCTA TTTGGTGTTC ACTTCCTCAT GTCAGAGAGA
TACGGTATGA CTCCCTCTGC TAAGGAAAAG CCCAAGGTTC CAGCATATTA CACAAAGGAG
GAGTGGTCTA AGTTCAACGA GTGGTGGCCC AGGAACGTGG TTTACATGCT ATCAATCGTG
CTGATGACCT GGGGAATTAT CCTGTTTATC CCAGATCTAC TAGCTAACAT CAACGGCCTA
CCCATTGTGA TCAATCCTTA CCCTGCACCT GAACCCGGGA CCGCAGCAGC TCTTTCCACA
CAACCCTACC CACCTTGGTT CTTCCTATTT CTATACAAGT TTGTGGACTT TGAGCTTCCC
AATGGACAGG CCATGAGTCC AGCTCAGGCA CTTTCAATAC TAGTGGTTAT TCTAATGGTG
CTGATTCTAA TGCCCTTCTT TGAGAACAGC GAATACATGT TCCTAAGGAA CAGGAAGTTC
TGGACGTGGA TAATGACCGT AGCGTGGGTG TCCCTCATAG AGTTAAGCGT ATGGGGATAC
CTAGCTCCAG GAGTTCCAGC TCCAACTAGC CAACAGGTGG AGATTCTCGG AATCCCAGCG
GTAATAATAG GTCTTGTTAT TCTTGCTACA GGTAGACAAA AAAGCTCAAA GTCAGCACCT
TCTCTTAATG CACCAGAAAC TGTTCCAAAA ATTGGACCAA CATCAATCCT TGGAACTGCA
ATAGCCTCGC TACTGTTCGC CGGCAGTTTC GGAGTTTGGT TAATGCACCC TGTCATGATT
AACATGATAA TGATGATACC CTTTGGGGCA TTAGCTGTTT ACATGGTGTA TAGGATGGCC
TCTGGAATGA GGGTTAAAGT TGCCAAGCCC GCGGGTACAA TGACATGGGA GGAGGTTAAG
TTTAGGAAAA CAATTGCCTT GTTTGCCCTT CCAGCAATTC TAGTGGTAAC AGCTATACAG
ACTGCAATAA TGTGGAAACT TCCAAGTGTG GGACCACAGG CTACCTACGC TGGAATGGAT
CTAGGTATAC TCCTGTTCCT ATGGGGTGTG GCCATCCAGC TTTATCATTA CATAGTTTAT
GTTAGGTGA
 
Protein sequence
MTETSQEKKK GLIDQILDRV GVTEAPFFKT PDYMYNISYW LGAMVSAAFI YTVITGLFLL 
LYYMPANPYP QTQAIINTVP YGSVILFSHL YGAYIMIILA YIHMFRNFYK GAYKKPRELQ
WVTGVLLLAL TMGASFFGYS LVSDVLGVNA INIGESLLVG TGFPGASTIA NWLFGPGGDA
ATASNPLVKS QLFDRLLGWH IIMVFLIGLL FGVHFLMSER YGMTPSAKEK PKVPAYYTKE
EWSKFNEWWP RNVVYMLSIV LMTWGIILFI PDLLANINGL PIVINPYPAP EPGTAAALST
QPYPPWFFLF LYKFVDFELP NGQAMSPAQA LSILVVILMV LILMPFFENS EYMFLRNRKF
WTWIMTVAWV SLIELSVWGY LAPGVPAPTS QQVEILGIPA VIIGLVILAT GRQKSSKSAP
SLNAPETVPK IGPTSILGTA IASLLFAGSF GVWLMHPVMI NMIMMIPFGA LAVYMVYRMA
SGMRVKVAKP AGTMTWEEVK FRKTIALFAL PAILVVTAIQ TAIMWKLPSV GPQATYAGMD
LGILLFLWGV AIQLYHYIVY VR