Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4659 |
Symbol | |
ID | 5736506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5955196 |
End bp | 5956626 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281823 |
Product | protoporphyrinogen oxidase |
Protein accession | YP_001547418 |
Protein GI | 159901171 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1232] Protoporphyrinogen oxidase |
TIGRFAM ID | [TIGR00562] protoporphyrinogen oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000642175 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTTG CTGATCAGCC ACGCATTGCC ATTATTGGCG GTGGTATCGC TGGCCTTAGT ACAGCATGGT ATTTACAACA ACAAGGTCTC ACGAATATTC AGCTTTTTGA ACGTGATCAA CGGCTGGGCG GCAAATTGCG CACCAGCCAT GTTGCCTTGC CCGATGGCGC TGGCGAATTG CTGGTCGAAG CTGGCCCCGA TGCCTTTATC AGCCAAAAGC CTTGGGGCTT GCAATTGGCG CGTGAGTTGG GCTTGGAAGA TCAGTTAATT TCAACTGAGC CAGCTCGCCA TAAGGTGTTT GTGTTGCATC GTGGCAAGCC CGAACCCTTG CCTGATGGCA TTAACTTGGT TGTCCCAACT GAGTTTTGGC CGTTGCTGCG CACGCCGATT CTCTCGCTGC CAGGCAAATT GCGTATGTTG CTCGATTTGG TCTTGCCTGC CCGCCAAAGT AATACTGATG AATCGCTGGC CGATTTTGTG CGCCGCCGAT TTGGGGCCGA AGCGCTGGAT AAATTGGCCG AGCCGTTGAT GGCAGGCATT CACAATGCAG AATCGGATCG CCAAAGCCTC GAAGCCACCT TTCCACGCTT TATCGAGGCC GAACGCAGCC ATGGCAGTGT GATTCGTGGG ATTCTGGCGG CTAAACTTAA AGCAGGCAAG CCCAAAGGTC AGCAACTTAG CCCATTTATT AGCTTACGCG GCGGGATCGA GCAATTAATT ACCGCGCTGG TTGAGCAGCT CAACGTTGAA ATTCGGACAA ATTGTGGGGT TCAAGCGCTG CGCTACGACC CAACCAACGC CTCAGCCTAT CAACTGACCC TCGATGATGG CACGAAGATT GATGCTGATG CAGTGGTGTT GGCAGTGCCT AGTTTTGTGG CCGCTGAGTT GGTCGCACCT TGGGCTGAAG CCTTGGCCGA GCGCTTGAAG GCGATTCGTT ATGTCAGCAC TGGCACAGTT TCGTTGGCAT TTCGGCGTAG CGAAACCAAC ATGGCCTTCG ATAGTTATGG CTTGGTGATT CCGCGCAGCG AATATCGGCT GATTAATGCT GTAACGATCA ACTCACGCAA ATTTGCTGGG CGTGCTCCCG CCGATTATAT GCTGTTGCGG GCCTTTGTGG GCGGCTCGAA ACATCCCGAA GTGCTGCGCT TGGATGATCA GGCATTAACT CAATTGGTGC GTGATCAGCT TAAATCGATT TTTGGCCTGA CCGCCGAGCC AATTTGGAGC GGGGTTGCCC GTTGGAACGA GGCTAATCCC CAATACGATG TTGGTCATTT CCAACGTATG GATCAGCTTG AGGCCTTGTG TCCAGAAGGC TTGTTGTTGT GTGGCAGCGG CTTTCGGGGC GTGGGCATTC CCGATTGTGT GCGCCAAGGC CAAGCAACCG CTCAGGCCAT TAGCCAATTG TTCGCTTTGG CTAACGCTTA A
|
Protein sequence | MDVADQPRIA IIGGGIAGLS TAWYLQQQGL TNIQLFERDQ RLGGKLRTSH VALPDGAGEL LVEAGPDAFI SQKPWGLQLA RELGLEDQLI STEPARHKVF VLHRGKPEPL PDGINLVVPT EFWPLLRTPI LSLPGKLRML LDLVLPARQS NTDESLADFV RRRFGAEALD KLAEPLMAGI HNAESDRQSL EATFPRFIEA ERSHGSVIRG ILAAKLKAGK PKGQQLSPFI SLRGGIEQLI TALVEQLNVE IRTNCGVQAL RYDPTNASAY QLTLDDGTKI DADAVVLAVP SFVAAELVAP WAEALAERLK AIRYVSTGTV SLAFRRSETN MAFDSYGLVI PRSEYRLINA VTINSRKFAG RAPADYMLLR AFVGGSKHPE VLRLDDQALT QLVRDQLKSI FGLTAEPIWS GVARWNEANP QYDVGHFQRM DQLEALCPEG LLLCGSGFRG VGIPDCVRQG QATAQAISQL FALANA
|
| |