Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5257 |
Symbol | |
ID | 5737215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | + |
Start bp | 29299 |
End bp | 30639 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641282421 |
Product | cytochrome P450 |
Protein accession | YP_001548012 |
Protein GI | 159901767 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAACG ATATTCGAGC ACTTCCATCA TTGCCTCAGT TTCCCTTTCT TGGGAATATT CTGTCATTTC GACAAGACAG ATTAAAGTTT CTCCGCGATC TCGGAAGCCA TGGAGATTTA GGGGTGTTCT ATCTTGGGTC ATATCCTGTT GTTTTTATTA ATTCGGCAGA ATATGCCCAT GGTATTCTTG TCCAGCATGC ACAATCGGTT GAAAAATCTT TGATGCTTCG TAAATATATG CGACCGTTGC TTGGCAATGG ATTACTCACA AGTGAAAATA GCTTCCACAG GCGGCAACGC AAACTCGTTG CACCAGCCTT TCAGCATCGC CATATTGCAA ACTACGCCAA TACCATATCA GCCTATACGG ATGAGACACA AGCCCGTTGG CATCAAGGAC AACGGATTGA TATTGCCCAA GAAATGATGC GGTTAACACT GCGTGTTATG AGCCAAACAC TCTTCTCTAC GGATATCAAT ACAGAAGCAG ATGCATTGGG GCGTGCACTT ACGACGGTTC TCAATTATTC AAATAGTGTG GCTAATACGC TTATCCATAT CCCATATCAT TGGCCTATTC CGCAGCACAA GCGGGTTCAC GCTGCAATCG CACAACTGGA TACAACGATT CAACGCCTTA TCCACGAGCG AAGAACTCAA CCTACATCAA CTAACGATTT GTTATCGGTC TTACTCCAAG CCCATGACGA CGATGATGGG TCATTTATGA CCGACACACA AGTCCGTGAT GAACTTATGA CACTTTTTTT GGCAGGCCAC GAAACAACCG CAAATGCCCT TACATGGACA TGGTATCTCC TTGCACACCA TCCTCATATC GCAACGAAGA TCAAAGATGA GGTTGATAGC ACAGTTGGCA CACGACTCCC AACCATGGAT GATTTATCAA AGCTTCCCTA TACATTACAA GTATTCAAGG AAAGTTTGCG ACTCTATCCC CCCGTTTATA TGATTGCCAG AAAAGCATCA CAAGCATTCG AGCTAGGGAG CTATCATGTC CCTGAGGGAA TGGCATTTGT CGTTAGCCCA TACACTATTC ATCGACGAGC CGACTATTTT GATCATCCTG AGGACTTTAA CCCTGATCGG TTTGATACGT CGCACGAGGC AAGCATCCCA AAAAATGCCT ATATTCCGTT CAGCTTAGGC CCACGAAACT GTATTGGGAA TCATTTTGCG ATGATGGAAG GGCATTTGAT GTTGGCAATT ATCGCTCAAC GAATGCGCTT GCTCCTTGCG CCAAACCAGC GAATCGTCCC TGATCCATCA ATCACATTAC GGCCCAAAGG GGCTATTCAT ATGATTGTTG AGCGATTTTA G
|
Protein sequence | MSNDIRALPS LPQFPFLGNI LSFRQDRLKF LRDLGSHGDL GVFYLGSYPV VFINSAEYAH GILVQHAQSV EKSLMLRKYM RPLLGNGLLT SENSFHRRQR KLVAPAFQHR HIANYANTIS AYTDETQARW HQGQRIDIAQ EMMRLTLRVM SQTLFSTDIN TEADALGRAL TTVLNYSNSV ANTLIHIPYH WPIPQHKRVH AAIAQLDTTI QRLIHERRTQ PTSTNDLLSV LLQAHDDDDG SFMTDTQVRD ELMTLFLAGH ETTANALTWT WYLLAHHPHI ATKIKDEVDS TVGTRLPTMD DLSKLPYTLQ VFKESLRLYP PVYMIARKAS QAFELGSYHV PEGMAFVVSP YTIHRRADYF DHPEDFNPDR FDTSHEASIP KNAYIPFSLG PRNCIGNHFA MMEGHLMLAI IAQRMRLLLA PNQRIVPDPS ITLRPKGAIH MIVERF
|
| |