Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1001 |
Symbol | |
ID | 5732904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1145734 |
End bp | 1147008 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278135 |
Product | putative oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_001543777 |
Protein GI | 159897530 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAC ATGCAATCAA ACATCTTTAC GTTCATATAC CATTTTGCCA AACCCGTTGT GCCTATTGCG ATTTCAATAC CTTTGCCAAT CGCGAAGATT TTATGCAGCG CTATATTGAT GCGCTGTGCC TGCATCTCAA GCGCATGGCG AGCGGCGAGA CAATCGCTGA CCCAACATGG CCGCAGGTCG CGGATGGGCC GATTCCGTGG GCAAGCTATC AATTACACGA TCTTGTTGGG CCGTTACAAC AGGCCGATTT GCCCGCGACG GTGTTTCTGG GCGGCGGCAC GCCAACTGCG CTACCATTGC ACTTACTTGA ACAGTTGATG CAAACAATCG GCCAAATTAT TCCGCTGGCG CAGGCCGAAG TAACCAGCGA AGCCAACCCA GGCACGGTGC TCGACCACAA TTATCTACGG GCAATGCGGT CGATGGGGAT TAATCGCCTA AGTATGGGCG TGCAAACCTT GCATGACCCA ACCTTGCGGG TGTTGGGGCG GATTCATACC GCCAGCGAGG CCTATGCCTC GTATCAAGCA GCTCGTAAAG TTGGCTTCGA AAATATCAAT CTTGATTTTA TGTTTGGCTT ACCAGGCCAA GATACTGCTC AATGGCGAGC GATGCTCAAC GAAATTGTAG GTTGGGATGC TGAGCATTTT GCGCTGTATT CATTAATTGT CGAGCCAAGC ACACCGCTAG CCGCCCAAGT AACTGCTGGT CGGGTCAGCA TTCCCAACGA CGATGCGACT GGCGAAATGT ATGAAGCTGC GATGGAAATT TTAGGGGCGG CTGGTTATGG CCATTATGAG ATTTCCAATT GGGCCAAAAC GCAGAACTCA GCGTTTAACC CAAGCGAGCG TTTGCCGGCC TATGCATCGC GCCACAATGT GGCCTATTGG CTCAACGCCG ATTATTTGGG AGTTGGGGCT GGTGCACATA GCCATTGGCG GGGCTGGCGC TGGGTCGATC AACGGATCTT AGAGCCTTTT GCTCAGCAGG TTGAACATGG TCAAGCACCG TTAATCGATA TCCAAGTATG CGAGCCACAA GACCGCGATT TTGAAACAAT TATGATGGGA TTGCGGCTCA ATTGTGGTTT AGGCTTTGCC CACTTTCAAC AACGGACAGG CCACGATTTG CTTGCGCAGT ATCAGCCGAT AATCGAACAA TTGGTTGGGC AAGGCTTGCT CGAACAAACC ACCAACGCCA TTCGGTTTAC CCCCCGTGGC CGAATGGTTG GCAATCAAGT GATTGAACGT TTTTTGCTTG ATTAA
|
Protein sequence | MTTHAIKHLY VHIPFCQTRC AYCDFNTFAN REDFMQRYID ALCLHLKRMA SGETIADPTW PQVADGPIPW ASYQLHDLVG PLQQADLPAT VFLGGGTPTA LPLHLLEQLM QTIGQIIPLA QAEVTSEANP GTVLDHNYLR AMRSMGINRL SMGVQTLHDP TLRVLGRIHT ASEAYASYQA ARKVGFENIN LDFMFGLPGQ DTAQWRAMLN EIVGWDAEHF ALYSLIVEPS TPLAAQVTAG RVSIPNDDAT GEMYEAAMEI LGAAGYGHYE ISNWAKTQNS AFNPSERLPA YASRHNVAYW LNADYLGVGA GAHSHWRGWR WVDQRILEPF AQQVEHGQAP LIDIQVCEPQ DRDFETIMMG LRLNCGLGFA HFQQRTGHDL LAQYQPIIEQ LVGQGLLEQT TNAIRFTPRG RMVGNQVIER FLLD
|
| |