Gene Haur_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1059 
Symbol 
ID5732963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1207830 
End bp1212137 
Gene Length4308 bp 
Protein Length1435 aa 
Translation table11 
GC content53% 
IMG OID641278194 
Productcobaltochelatase, CobN subunit 
Protein accessionYP_001543835 
Protein GI159897588 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID[TIGR02257] cobaltochelatase, CobN subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.792006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC GTTTGCAACG CAATATGGTC GAGCGCATCG ATGGACGCAT GGTCAACGCT 
GGTTATAAAC GCGGCATGTT GTTTATCTGC GCTCATGGCT GCTGCTGTGG GATTACTGAG
CGTGGTTTTG CCCCAGTTTG GCCGGAAATT CACCAGCATG AATGGGAAAG CCGCAAGCTG
CGCAATATTG TGCATCTGAA TATGGGCGGT TGTTTGGGTC CATGTCCCTT GGCGAATGTG
GCCATGTTGA TGTTTGATGG CCAGCCGATT TGGTTTCATT CGCTCAACCA ACCAGAGTTG
GTGGTAGCGC TCTACGACTA CATCGAAGCC ATGATTCAGG CCGATCAGGT GCTGCCTGCG
CCAGCCAATC TTAAGCCCCA TGTCTTTAAT GGCTTTAAAT GGGATGGCCA AGCACCCGCC
GCAATCAGCC CAGCGCCAGC GTTACCAGCC AGCCAGCCCA GCCTTGGCAA TGGCATCATA
TTTTTGACCC AAGCGGATAC CGACATCCTG TTAATTGAGC AAGCGCGTCG CCAATTGCCC
AGCGATTTTG CCAATGTGCA AACCGCCAAC GTGGGCGAGG CCGAAGATCA AACTGCGCTT
GATGGCTTAT TTGATCACTT GTTGCCCCAA GCGGAAATTG TGGTTGTGCG TTTGATTACG
GGCAGTCGAG GCTTTGAGCA TGGCATCGAG CGCTTGCAGC AATGGACACA ACAAACCAAT
GGCTTTGTGC TGTATTTGCC TGGCTTTGAG GCGCTTGACC CTGAGTTGAT GGCGCTCTCG
AATGTTGGCG TGCCGCTGGC GCACTTAATC AGCAGCTATT TTATGCAGGG CGGCTTGGAA
AATGTCGTCA ATGCCTTGGC CTGTTTGAGC GACCACTTAT TGCTCACCGG TTGGGGCTAC
GACGCGCCAC AGGAATTACC CCGCCATGGC ATCTACACGC CAGCGTTAAA TCGCTGCGAA
GGCTGTGCCA CTGCCCAATG CCAAACCAGT GCGCAAGCCG ATAAGCCAAC CGTGGGCGTG
CTGTTTTATC GGGCGCACAT GCTGAGCGGG AACACCGATT TTGTTGATGC AATTTGTCAG
GCGATCGCAG CTCAGGGCAT GCAACCACGG GCGGTCTATA CCCACTCGCT CAAGGAAGGC
GGCAGCAACA ATCTGCCCGC CGCCTTGGAA TGTTTGCAAG CCGATGGGCC AATTGATGCC
TTGATTTGCA GTTTGTCGTT TGCTTTGGGC AATGTTAATA CCGAAGGCCC AACCCTCGCT
GGCGCGGAAA TCGATATGCT ACAACAACTG AATGTGCCGA TTATCCAAGC GATTGCGAGT
GGCCGCTCAC GTGAGCAATG GCAACGAGCA GGTCGTGGAT TAAGCTCGTT GGATACAGCG
ATCAATGTGG CGATTCCTGA ATTTGATGGC CGAATTATCA CCGTGCCAAT CTCGTTCAAG
GAGCAAGACC CCAATTCTGG CGGCGTGCGC TATATGCCAG ATCTTGAGCG GGTTGAGCGA
GTGGTCGGCA TTACGCGGCG CTTAGTCAAT TTGCGCCACA AACCCAACGC TGAAAAACGC
ATAGCTTTTG TGTTTACCAA CAGCAGCGCC AAAGCCTCGC GGGTTGGCAA TGCGGTCGGC
TTGGATGCGC CTGCTTCATT ATTAACCCTA CTCCACGCCA TGCAAGCCGA GGGCTATCAG
GTTGGCAAAT TGCCCGCTAG CAGCGATCAA CTCTTGTTCG ATTTGATCGA TCGTTGTTCG
TATGACGAAA CCTGGCTGAC TGAGCAACAA TTAGCTCAGG CGGTGCATGT ACCAGTCGAT
CAATATCAAC AATGGTTTGC TGAATTGCCC GCCAATTTGC AAGCCAGCAT GATCAAACAA
TGGGGCGAAG CGCCTGGCAC GGCCTACCTG ACCGAGCAAG GGCTAGCCTT GGCGGGCTTA
GAGTTTGGCA ACATCTTTGT GGCTTTGCAG CCGCCGCGTG GCTATGGCAT GGACCCAAAT
GCAATTTACC ATATGCCCGA TTTGCCGCCG CCCCACAGCT ATTATGCGCT CTACCGCTGG
CTACGCGATG GTTGGCAGGC CGATACGCTG GTGCATATGG GCAAACATGG CACGCTGGAA
TGGCTGCCGG GCAAGAGCGT TGGCCTCAGC CGCGAGTGCT TCCCCGATGC TTTAATTGGC
GATTTGCCTG TAATTTATCC GTTTATCATC AATGATCCAG GCGAGGGCAA TCAGGCCAAA
CGCCGTAGCC ATGCCGTGAT CATCGACCAT ATGACTCCGC CGATGACCAG TGCTGGAGCC
TATGGCCAGT TGGCCGAATT GGCGCAGTTG GTCAATGAAT ATTATCAAGT TGAGCAACTC
GACCCCAATA AATTGCCCTT GTTGCAGCGC CAAATTTGGA ACTTGCTGCA AACCAGCAAT
CTCAGCGACG ATTTGCAATT TATTCTCAAG GCCAACCACG GCGACCACAC CCACGATTGG
GATGGCTCGT TCCTCGAAGA TGGCACGCCC ACAGCCTTTG CCGAGATGGA AGGCCGTCAA
GTCGCCCACT TGCTGGAAGA TATTGAAGGC TATTTGTGCG AACTAACCGG AGCGCAAATT
CGCGATGGCT TGCACATTTT AGGAACCTTG GCCGAAGGCG ACCAACTGCC AGAATTGCTG
TTTCACCTGA CCAAACTACC CAACCTTGAT GTGCCCAGTT TACCAGTGGC AGTAGGCAAA
CTCTACGGCT TGGATTGGAA TAATTTGCAA GCCAATTTGG GCGAGCGCTT AGCTCAGCCG
TTGCAATTGA CCGACCAAAC GCTTTACAGC AATGGCGATG TGGCGGCTTG GATCGAACAA
CATTGCAAAA CAATTTTACA TACCTTGGAG TTGCAAGCTT GGCAAGTTAG CGCAATTCCG
GCGGCGTTGC AAGCGAGTTT ATTGCCTAGC GCAGCAGCTT GGGATCAAAC GGTCGTCGCG
CCCTTGGAAT TTATCTGCAC CCAACTTATC CCTAACTTGC ATGAAAGTGC GCGAGCTGAA
ATTGCCAGCC TGCTGCATAG CTTGAATGGT GGCTATATTC CGGCGGGGCC AAGTGGAGCG
CCAACTCGCG GTATGGCTCA TGTGCTGCCA ACAGGCCGCA ATTTCTATGC TGTCGATCCG
CGTTCGTTGC CTAGTGCGGC GGCGTGGCAA GTTGGCCAGC ATTTGGCCGA TGATCTGATT
CGGCGCTATC AACGGGAAGA AGGCAGCTAC CCGCGCAGCG TTGGCATTAG CATTTGGGGC
ACTAGCGCCA TGCGCACCTA CGGCGATGAT ATTGCCCAAG TGCTGGCCTT ATTGGGCGTG
CGGCCAGTGT GGCAAGCTGA AAACCGCCGC ATCACGGGCG TTGAAGTAAT TCCCTTGGCC
GAGCTTGGCC GCCCACGGAT TAATGTGGTT TGTCGAATTA GCGGCTTCTT CCGCGATGCC
TTCCCACATT TGGTCAGTTT GCTTGACCAA GCCGCCCAAA CCGTGATTGA ACTTGATGAA
CCACTTGATC AGAATTTTGC TCGTCAGCAG GCCTTTGCGG CCACTGAGCA GCTAATTAAA
AACGGCTTAG CGCCCGAAAT TGCCCAACAA CAAGCGCGTT ATCGGGTTTG GGGCTGCAAG
CCAGGCAGTT ATGGCGCAGG CATTTTGCCC TTGATTGATA GCCAAAATTG GCAAGATGAC
GCAGATTTTG CCCGAGCCTA CATCGCTTGG GGTGGTTATG CCTACACCAG CGACGACTAT
GGGATTGAGG CTGCCGATGC CTTTGGCACA GCTTTGAGTG AAGTTCAAGT TGCCACCAAA
AATCAAGATA ATCGCGAACA CGATATTTTT GATAGCGACG ACTATATGCA ATATCACGGC
GGCATGATCG CTACGATTCG CGCCTTGACT GGACGCAACC CACGCCGCTA CTTTGGCGAT
TCGAGCAACC CACAGCGCCC GCAAACCCGC GATTTGCGCG AAGAAGCGCG ACGGGTATTT
CGTAGTCGGG TGGTTAATCC CAAATGGATC GAAAGCATGC GCCGCCATGG CTACAAGGGC
GCATTGGAGC TAGCTGCCAC CGTCGATTAT CTGTTTGGCT ACGATGCCAC GGCTCAAGTT
GTTGATGATT GGATGTATCA CTCGATTGCT GAGCACTATT TGCGCGATGA AGAGATGCAA
CAATTTTTTG CCGATAGCAA CCCGTGGGCT TGGCAAGCGA TTGCTGAACG CCTACTCGAA
GCGATTGATC GCGATTTATG GCACGACCCT GAACAACACG ATATCGATCT GCTCAAAGCC
GCCCAATCGT TGGGTTTGAG CAATTTGCAA CAACGAGGCC AGCTATGA
 
Protein sequence
MTERLQRNMV ERIDGRMVNA GYKRGMLFIC AHGCCCGITE RGFAPVWPEI HQHEWESRKL 
RNIVHLNMGG CLGPCPLANV AMLMFDGQPI WFHSLNQPEL VVALYDYIEA MIQADQVLPA
PANLKPHVFN GFKWDGQAPA AISPAPALPA SQPSLGNGII FLTQADTDIL LIEQARRQLP
SDFANVQTAN VGEAEDQTAL DGLFDHLLPQ AEIVVVRLIT GSRGFEHGIE RLQQWTQQTN
GFVLYLPGFE ALDPELMALS NVGVPLAHLI SSYFMQGGLE NVVNALACLS DHLLLTGWGY
DAPQELPRHG IYTPALNRCE GCATAQCQTS AQADKPTVGV LFYRAHMLSG NTDFVDAICQ
AIAAQGMQPR AVYTHSLKEG GSNNLPAALE CLQADGPIDA LICSLSFALG NVNTEGPTLA
GAEIDMLQQL NVPIIQAIAS GRSREQWQRA GRGLSSLDTA INVAIPEFDG RIITVPISFK
EQDPNSGGVR YMPDLERVER VVGITRRLVN LRHKPNAEKR IAFVFTNSSA KASRVGNAVG
LDAPASLLTL LHAMQAEGYQ VGKLPASSDQ LLFDLIDRCS YDETWLTEQQ LAQAVHVPVD
QYQQWFAELP ANLQASMIKQ WGEAPGTAYL TEQGLALAGL EFGNIFVALQ PPRGYGMDPN
AIYHMPDLPP PHSYYALYRW LRDGWQADTL VHMGKHGTLE WLPGKSVGLS RECFPDALIG
DLPVIYPFII NDPGEGNQAK RRSHAVIIDH MTPPMTSAGA YGQLAELAQL VNEYYQVEQL
DPNKLPLLQR QIWNLLQTSN LSDDLQFILK ANHGDHTHDW DGSFLEDGTP TAFAEMEGRQ
VAHLLEDIEG YLCELTGAQI RDGLHILGTL AEGDQLPELL FHLTKLPNLD VPSLPVAVGK
LYGLDWNNLQ ANLGERLAQP LQLTDQTLYS NGDVAAWIEQ HCKTILHTLE LQAWQVSAIP
AALQASLLPS AAAWDQTVVA PLEFICTQLI PNLHESARAE IASLLHSLNG GYIPAGPSGA
PTRGMAHVLP TGRNFYAVDP RSLPSAAAWQ VGQHLADDLI RRYQREEGSY PRSVGISIWG
TSAMRTYGDD IAQVLALLGV RPVWQAENRR ITGVEVIPLA ELGRPRINVV CRISGFFRDA
FPHLVSLLDQ AAQTVIELDE PLDQNFARQQ AFAATEQLIK NGLAPEIAQQ QARYRVWGCK
PGSYGAGILP LIDSQNWQDD ADFARAYIAW GGYAYTSDDY GIEAADAFGT ALSEVQVATK
NQDNREHDIF DSDDYMQYHG GMIATIRALT GRNPRRYFGD SSNPQRPQTR DLREEARRVF
RSRVVNPKWI ESMRRHGYKG ALELAATVDY LFGYDATAQV VDDWMYHSIA EHYLRDEEMQ
QFFADSNPWA WQAIAERLLE AIDRDLWHDP EQHDIDLLKA AQSLGLSNLQ QRGQL