Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2891 |
Symbol | |
ID | 5734762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3661984 |
End bp | 3663357 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280034 |
Product | cytochrome P450 |
Protein accession | YP_001545657 |
Protein GI | 159899410 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00974198 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACTG AAACCGCCTT GGCAAGCCCA GCCTTGATTG CGCCAGGCCC GTGTGGCCGC CTGTTGATTG GCAATTTGCA CGATTTTATC GATGATTTTC TGGTCACGAT GCAACGTGAT TTTGTTAATC ATGGCGATAT TGTGCGCTAC CAAATTGGCT CACGCATTGT GCATGTGGTT TCAAATCCCG ATTATGCGCA ATATGTACTT GTTGAGCATC AGCGGGATTT CCCCAAAGTT GGCGGCAACG GCGGTTTACA AATTATCGCT GGCAATGGCT TGATTTCTAA TCCCAGCCCT GAATCGTGGC TCATTCAGCG GCGTATGATG CAGCCAATGT TTCACCGCAA ACGCCTCGCG GCCATGGGTG AAAAAATCGA TGGCGCAGGC GCACGTATGA TCCAGCGTTG GCAAGCGTTG CCCGATGCAG CACCGATCGA CATGGATCAT GAAATGCTGC AAGTGACGCT TGATATTATT ATGCAAACCA TGTTTAGCGC CGATATGCTG GGCGAAGTGG GCAAATTGGC TCCGGCAGTC ACTGCGGCGG TCGATTATGC AAATTATCGC ATTTTTAATC CGTTCAGCCT GCCCTTACCA ATGCCAACCC GCCGCAATCG CGCCTATATG CAAGCTCGCA AAGTGCTGGA TAGCATGATT TTTGGTTTGA TCAAACAACG ACGGGCGGCC ACCGAGCCAG TTGGCGATTT GCTGGATATG TTGCTCGAAG CCCAAGATGC CGAGACTGGC GAGCGCATGA GCGATGAGCA GATTCGCGAT GAAGTCCTGA CAATCTTTGC GGCTGGTCAC GAAACCACGG CCAATACCTT GACTTTTGGC TGGTATTTGC TCAGCGAACA CTGCGAAATT CGCCAAAATC TCCAAACCGA GCTTGATCAG GTGTTGCAAG GCCGAGCACC AAGCGTCAAC GATTTGCCGC AATTGCCCTA CACGTTGCAA GTATTTAAAG AGGCCATGCG CTTGTACCCA GCTGCGCCAA TTACTGGACC TCGCCGTGTT ACCAAACCCA CCCAACTTGG TGGCTACGAC TTGCCACTCA ATTCGCAAGT GATCGTCAGC ATCACCAATT TGCATTTGCA TCCGGCTTTT TGGGAAAATC CTTTGCAATT TGATCCGAGC CGTTTCGCGC CAAATGCCAA TCAGCCTCGT CACCATTTGG CGTTTATGCC GTTTGGCGCG GGGCCGCGTA AATGCATTGG CAATAATTTG GCCGAGATGG AAGGCGCATT ATTACTAGCA TGTGTGGCGC AGCATTACAA CCCGCAATTG CAGCCAGGCC ACCAAGTTAA GCCTGAAATG GCGATTACCA TGCGAGCCAA AGCTGGCATG CCGATGTTGC TCAAGCGGCG CTAA
|
Protein sequence | MMTETALASP ALIAPGPCGR LLIGNLHDFI DDFLVTMQRD FVNHGDIVRY QIGSRIVHVV SNPDYAQYVL VEHQRDFPKV GGNGGLQIIA GNGLISNPSP ESWLIQRRMM QPMFHRKRLA AMGEKIDGAG ARMIQRWQAL PDAAPIDMDH EMLQVTLDII MQTMFSADML GEVGKLAPAV TAAVDYANYR IFNPFSLPLP MPTRRNRAYM QARKVLDSMI FGLIKQRRAA TEPVGDLLDM LLEAQDAETG ERMSDEQIRD EVLTIFAAGH ETTANTLTFG WYLLSEHCEI RQNLQTELDQ VLQGRAPSVN DLPQLPYTLQ VFKEAMRLYP AAPITGPRRV TKPTQLGGYD LPLNSQVIVS ITNLHLHPAF WENPLQFDPS RFAPNANQPR HHLAFMPFGA GPRKCIGNNL AEMEGALLLA CVAQHYNPQL QPGHQVKPEM AITMRAKAGM PMLLKRR
|
| |