Gene Haur_4252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4252 
Symbol 
ID5736106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5425281 
End bp5426969 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content50% 
IMG OID641281407 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001547012 
Protein GI159900765 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000252204 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTAT CGCCTGAAAG CAAGCGCAAG GGCTTCGGTG GCTGGCTAAC CGAGGTTGGT 
CGCTCATTCT TCCCTTCGAT GGGGGCACGT GAGTGGCGCG AAACGCTCCG TGGTGAGCCA
GCACCACGCC CAAACCCACG GATGCGGGTG CATTCAAATA GTTTTTGGTA CCACATTCGC
CCACGTTCGT TATCTGAAGA AGCCACCGCA TGGTATTACA CTATGGGGCT TGGCTGGATG
TCGTTTTTCT TCTTTGTGCT TGAGGCGATT ACTGGTCTCG TTTTGATGAT CTATTATTCG
CCATCGCCCA ATGAAGCCTA CGCCACTATG ACTCAGATTA TGAACGATGT GCCGTTGGGC
GGCCTGATGC GGAATGTTCA CCGCTTAGGG GCGCACTTTA TGGTTGCAGT GGTGATTTTA
CACATGCTGC GAACCTACTT TACGGCCTCG TATAAAGCTC CGCGCCAGTT TATCTGGTTT
ACGGGGATGA TCCTGCTCTT TATGACGCTG TTGTTGTCGT TCTCTGGCTA TCTCTTGCCA
TGGGACCAAT TGGCGTTCTG GGCGGTGACG ATTGGTTCGT CGATGGCCGA TGCTGCACCT
GGGGTTGGGC CGGCAATTGG CCGCTTGCTC CGTGGTGGCG CTGAAATTGG CGCGGGCGCA
CTCTTGCGCT TCTATTTGCT GCACATCTTT ATGTTGCCAA TGCTGACGAT TATTTTCATC
AGTATTCACT ACTATGCAGT GCGCAAGCAA GAAATCTCGC CAATTCACGA ATTGTTTGAA
AACAAAAAAC CAACCAAGCG CAAAATCCCC TTCTTGCCAG ATCAAGTGTT CTTTGAATTG
GCCGTGATTG TGGTGTTGAC CTTTGCCTTT ATCTTTATCA ACAACTTCTT CTGGGATGCC
AAGCTGGAAA ATCACGCCAA CGCTTTGGAA ACCCCTCAAC ACACCCAAGC ACCATGGTAT
TTCTTCTGGT TGCAAGGGAT GTTGAAGCTT GGTGATAAGA TCGTTTGGGG CTTGGGCATC
GCTGGGATCA TCTTCGGCGC ACTGTTCCTC TTGCCATACA TCGACCGCAA CCCTTCACGC
CGCTTCAAAG ATCGCAAATT TGCGCTTGCT GGCGGGATCG TTTCGTTGAT TGTCTTTATT
GTGGTTTCGT ATGGTGGCTT GCCCGCCTTC GGGATTCAAA AAGTCGGTTC GAACGAATTG
GCCGTGTCGT ATGTACCGGT TGAAGGCGAA GGTCGAGTGA TGGAAGTGCC ATTCGACCAA
GTGCCGCAAG AAAAATTTGT CTATAAAGTC TATTACGATG CAACCAAAGA TGCGTTTGTC
GATGGTGAGT TTGGCGTAGC CGAAGGGCCA TTGCCAGAAG CACTCTCGCC CGTCTTCAAA
GAAATGTTGC TCGAACTCAA GCACGATGTG CAAAAATGGG CTGAATATGA TGTGTTGTTT
GTTCGCCCAA CCGTAACCTT GACGATCGAG CCATGGCTCT ATCAACAAGA TACCGATGCT
GCCGGATTCT CAACAGCAGT CGATGGGATT CTGCAAAAAC GTGTGACCTT GGATATGGAA
TGGACAACGG CGGGTTACGA TGCCGAAGGT AATTTGGTTG AAACCCCTGA AAAGAGCCGT
TACACCCAAT ATAAATTCCT GAACCGCAAT GGGGTTGTTC ACGTTGGCGA TACAGAACCA
CGAAACTAA
 
Protein sequence
MAVSPESKRK GFGGWLTEVG RSFFPSMGAR EWRETLRGEP APRPNPRMRV HSNSFWYHIR 
PRSLSEEATA WYYTMGLGWM SFFFFVLEAI TGLVLMIYYS PSPNEAYATM TQIMNDVPLG
GLMRNVHRLG AHFMVAVVIL HMLRTYFTAS YKAPRQFIWF TGMILLFMTL LLSFSGYLLP
WDQLAFWAVT IGSSMADAAP GVGPAIGRLL RGGAEIGAGA LLRFYLLHIF MLPMLTIIFI
SIHYYAVRKQ EISPIHELFE NKKPTKRKIP FLPDQVFFEL AVIVVLTFAF IFINNFFWDA
KLENHANALE TPQHTQAPWY FFWLQGMLKL GDKIVWGLGI AGIIFGALFL LPYIDRNPSR
RFKDRKFALA GGIVSLIVFI VVSYGGLPAF GIQKVGSNEL AVSYVPVEGE GRVMEVPFDQ
VPQEKFVYKV YYDATKDAFV DGEFGVAEGP LPEALSPVFK EMLLELKHDV QKWAEYDVLF
VRPTVTLTIE PWLYQQDTDA AGFSTAVDGI LQKRVTLDME WTTAGYDAEG NLVETPEKSR
YTQYKFLNRN GVVHVGDTEP RN