Gene Haur_0105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0105 
Symbol 
ID5731998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp137897 
End bp139744 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content52% 
IMG OID641277227 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001542885 
Protein GI159896638 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTG ACGTAAGTAC CCTCCATCAC GCCCCAGCTA AGCAACGGCG GGGTTGGTTG 
GAATGGATCA CCACTGTCGA TCATAAAAAA ATTGGCATCA TGTATGCTGT GACTGCCTTT
TTATTCTTTG TGATCGGTGG GCTTGAGGCG TTGCTCATCC GTTTGCAGCT TGGTACGCCC
CAAAATACGC TGCTGACACC CGAAGCTTAT AACCAAGTCT TTACGATGCA CGGCACAACC
ATGGTGTTTA TGGCGATTAT GCCTGTCAAC GCTGGGTTTA TGAACTATTT TGTGCCCTTG
ATGATTGGCG CAGGCGATAT GGCCTACCCG CGCATGAACG CCATGAGCTA TTGGCTCTTG
TTGTTCGGCG GGATTGTCAT GTATTCCAGC TTTGTGTTCG GCGGCGGCGC TCCCGATGCT
GGCTGGTTTG CTTATGCTCC ACTCACCTCA ACCACCTACT CGGTTACCCG AGGGATGGAT
TATTGGGTGC TAGGTTTGCA GCTGCTGGGG GTTTCCTCGC TGGCTGGCTC GGTCAACATC
ATCGTAACAA TCATCAGGCT ACGAGCACCT GGCATGCGCT TTAATCGGAT GCCGCTGTTC
GTTTGGATGA GCTTCGTTAC CTCATTCTTG TTGATTTTTG CCTTGCCAAG CATCACGGTT
GGGATCACCT TGCTGTTCTT TGACCGCAAC TTTGGTACCA ACTTCTTCTT GCCTGCTGCT
GGCGGCGACC CACTGTTGTG GCAACATTTG TTCTGGTTCT TCGGTCACCC CGAAGTGTAC
ATTATGATTC TGCCCGCCTT CGGTGTGGTT TCAGAAATGT TGCCAGTTTT CTCACGCAAG
CCGATTTTCG GTTACGAGTT TGTGGCCTAC TCTGGGGTTG CAATCGGGGT GTTGGGCTTC
ACCGTGTGGG CACACCACAT GTTTGCCACC AACTTGGGCG TGATTGCCGA CACCTTCTTT
GCTGCCGCTT CGATGTTGAT TTCTGTCCCG ACGGGGGTCA AAATCTTCAA CTGGCTAGCA
ACCTTGTGGC GCGGTGAGTT GCGCTTCAAA ACCCCAATGC TGTTCAGCCT TGGCTTTATC
GCCATGTTCG TGATCGGCGG GATTTCCGGG GTTTCGTTGG CGGCTGCGCC GTTCGACTTA
CAAGTTACCG ATAGCTACTT TGTGGTTGCT CACTTCCACT ATGTGCTGTT TGGTGGCGCT
GTCTTTGCCT TGTTTGGTGC ATCATACTAT TGGTTCCCCA AAATTACTGG CAAGATGATG
AGCGAACGCA TCGGCAAATG GCACTTCTGG ATTTTAATGC TGGGCTTTAA TTTGACCTTC
TTCCCGCAAC ATATGCTTGG GCTGCAAGGT ATGCCACGGC GGGTCTGGAC GTATCAACAA
AACCAAGGCT GGGATTTCTA CAACTTGCTT TCAACGATTG GGGCATTTTG TATCGCCTTG
GGTACGTTGG TCTTTATCGT CAACTTTATT ATCAGCTTGC GCAAGGGCGC TCCTGCGGGC
AACGACCCAT GGGATGCCGC AACCCTGGAA TGGGCGACCA CCTCGCCACC ACCAGCCCAC
AACTTCGATG TGGAATACAT TGTGCATAGC CGTCGCCCCT TGTGGGATAA CAAATATAGC
GGCGAAGGTC GAGGCATGAC GATCAATTAT GATTTCCATC CCCACTTGCC ACCGCCATCG
TTTGCCCCAA TCATCTTCTC GTTTGGCTTG TTTGTGCTAG CTTATGGCAT GCTCAACCTC
GCCTCTGCGC CATTGATTGG GATTCCATTG ATCTTGGTGG CCTTGGCAAT TGCCTTCGTT
GGCATGAACC GCTGGGTCGG CGAAATTGCC CAAGATCCGG TGCTGTAA
 
Protein sequence
MATDVSTLHH APAKQRRGWL EWITTVDHKK IGIMYAVTAF LFFVIGGLEA LLIRLQLGTP 
QNTLLTPEAY NQVFTMHGTT MVFMAIMPVN AGFMNYFVPL MIGAGDMAYP RMNAMSYWLL
LFGGIVMYSS FVFGGGAPDA GWFAYAPLTS TTYSVTRGMD YWVLGLQLLG VSSLAGSVNI
IVTIIRLRAP GMRFNRMPLF VWMSFVTSFL LIFALPSITV GITLLFFDRN FGTNFFLPAA
GGDPLLWQHL FWFFGHPEVY IMILPAFGVV SEMLPVFSRK PIFGYEFVAY SGVAIGVLGF
TVWAHHMFAT NLGVIADTFF AAASMLISVP TGVKIFNWLA TLWRGELRFK TPMLFSLGFI
AMFVIGGISG VSLAAAPFDL QVTDSYFVVA HFHYVLFGGA VFALFGASYY WFPKITGKMM
SERIGKWHFW ILMLGFNLTF FPQHMLGLQG MPRRVWTYQQ NQGWDFYNLL STIGAFCIAL
GTLVFIVNFI ISLRKGAPAG NDPWDAATLE WATTSPPPAH NFDVEYIVHS RRPLWDNKYS
GEGRGMTINY DFHPHLPPPS FAPIIFSFGL FVLAYGMLNL ASAPLIGIPL ILVALAIAFV
GMNRWVGEIA QDPVL