Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4252 |
Symbol | |
ID | 5736106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5425281 |
End bp | 5426969 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281407 |
Product | cytochrome b/b6 domain-containing protein |
Protein accession | YP_001547012 |
Protein GI | 159900765 |
COG category | [C] Energy production and conversion |
COG ID | [COG1290] Cytochrome b subunit of the bc complex |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000252204 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTAT CGCCTGAAAG CAAGCGCAAG GGCTTCGGTG GCTGGCTAAC CGAGGTTGGT CGCTCATTCT TCCCTTCGAT GGGGGCACGT GAGTGGCGCG AAACGCTCCG TGGTGAGCCA GCACCACGCC CAAACCCACG GATGCGGGTG CATTCAAATA GTTTTTGGTA CCACATTCGC CCACGTTCGT TATCTGAAGA AGCCACCGCA TGGTATTACA CTATGGGGCT TGGCTGGATG TCGTTTTTCT TCTTTGTGCT TGAGGCGATT ACTGGTCTCG TTTTGATGAT CTATTATTCG CCATCGCCCA ATGAAGCCTA CGCCACTATG ACTCAGATTA TGAACGATGT GCCGTTGGGC GGCCTGATGC GGAATGTTCA CCGCTTAGGG GCGCACTTTA TGGTTGCAGT GGTGATTTTA CACATGCTGC GAACCTACTT TACGGCCTCG TATAAAGCTC CGCGCCAGTT TATCTGGTTT ACGGGGATGA TCCTGCTCTT TATGACGCTG TTGTTGTCGT TCTCTGGCTA TCTCTTGCCA TGGGACCAAT TGGCGTTCTG GGCGGTGACG ATTGGTTCGT CGATGGCCGA TGCTGCACCT GGGGTTGGGC CGGCAATTGG CCGCTTGCTC CGTGGTGGCG CTGAAATTGG CGCGGGCGCA CTCTTGCGCT TCTATTTGCT GCACATCTTT ATGTTGCCAA TGCTGACGAT TATTTTCATC AGTATTCACT ACTATGCAGT GCGCAAGCAA GAAATCTCGC CAATTCACGA ATTGTTTGAA AACAAAAAAC CAACCAAGCG CAAAATCCCC TTCTTGCCAG ATCAAGTGTT CTTTGAATTG GCCGTGATTG TGGTGTTGAC CTTTGCCTTT ATCTTTATCA ACAACTTCTT CTGGGATGCC AAGCTGGAAA ATCACGCCAA CGCTTTGGAA ACCCCTCAAC ACACCCAAGC ACCATGGTAT TTCTTCTGGT TGCAAGGGAT GTTGAAGCTT GGTGATAAGA TCGTTTGGGG CTTGGGCATC GCTGGGATCA TCTTCGGCGC ACTGTTCCTC TTGCCATACA TCGACCGCAA CCCTTCACGC CGCTTCAAAG ATCGCAAATT TGCGCTTGCT GGCGGGATCG TTTCGTTGAT TGTCTTTATT GTGGTTTCGT ATGGTGGCTT GCCCGCCTTC GGGATTCAAA AAGTCGGTTC GAACGAATTG GCCGTGTCGT ATGTACCGGT TGAAGGCGAA GGTCGAGTGA TGGAAGTGCC ATTCGACCAA GTGCCGCAAG AAAAATTTGT CTATAAAGTC TATTACGATG CAACCAAAGA TGCGTTTGTC GATGGTGAGT TTGGCGTAGC CGAAGGGCCA TTGCCAGAAG CACTCTCGCC CGTCTTCAAA GAAATGTTGC TCGAACTCAA GCACGATGTG CAAAAATGGG CTGAATATGA TGTGTTGTTT GTTCGCCCAA CCGTAACCTT GACGATCGAG CCATGGCTCT ATCAACAAGA TACCGATGCT GCCGGATTCT CAACAGCAGT CGATGGGATT CTGCAAAAAC GTGTGACCTT GGATATGGAA TGGACAACGG CGGGTTACGA TGCCGAAGGT AATTTGGTTG AAACCCCTGA AAAGAGCCGT TACACCCAAT ATAAATTCCT GAACCGCAAT GGGGTTGTTC ACGTTGGCGA TACAGAACCA CGAAACTAA
|
Protein sequence | MAVSPESKRK GFGGWLTEVG RSFFPSMGAR EWRETLRGEP APRPNPRMRV HSNSFWYHIR PRSLSEEATA WYYTMGLGWM SFFFFVLEAI TGLVLMIYYS PSPNEAYATM TQIMNDVPLG GLMRNVHRLG AHFMVAVVIL HMLRTYFTAS YKAPRQFIWF TGMILLFMTL LLSFSGYLLP WDQLAFWAVT IGSSMADAAP GVGPAIGRLL RGGAEIGAGA LLRFYLLHIF MLPMLTIIFI SIHYYAVRKQ EISPIHELFE NKKPTKRKIP FLPDQVFFEL AVIVVLTFAF IFINNFFWDA KLENHANALE TPQHTQAPWY FFWLQGMLKL GDKIVWGLGI AGIIFGALFL LPYIDRNPSR RFKDRKFALA GGIVSLIVFI VVSYGGLPAF GIQKVGSNEL AVSYVPVEGE GRVMEVPFDQ VPQEKFVYKV YYDATKDAFV DGEFGVAEGP LPEALSPVFK EMLLELKHDV QKWAEYDVLF VRPTVTLTIE PWLYQQDTDA AGFSTAVDGI LQKRVTLDME WTTAGYDAEG NLVETPEKSR YTQYKFLNRN GVVHVGDTEP RN
|
| |