Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4177 |
Symbol | |
ID | 5736038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5326221 |
End bp | 5327600 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281331 |
Product | cytochrome P450 |
Protein accession | YP_001546937 |
Protein GI | 159900690 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00497532 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTTA AGATGTTACC TGGCCCCAAA GGAACCCGCC TCGGTGGAAG TCGGCGCGAT TTACATAAAT ATGGCCCGTT GGGCTTTTTC GAATATCTAG CAAGTTTTGG CGATTTCACG ACCTGTCGCA TGGGGCCGTT TCGGGTCTAT CTGGTCAATG ATCCGGCTGG GATTCAAGAG CTTTTGGTGA CCAATCGCGA TAAGGTGCGC AAAAATGGCG GCGATCGCGA GTTGCTTTCG CGCTTTTTAG GCAATGGTTT GCTCAGCAAT GATGGCGCTG ATCATCAAAA GCAGCGCAAA TTGGTTCAGC CTGCGTTTCA TATGAAGCGC ATTCAGGCCT ACGCTGAAAC CATGGTTGAG CATACCCAAG CCATGCTCGA ACGTTGGCAC GATGGCGCGA TTCTGGATAT GGATCAGGCC ATGATGGAAT TGACCTTGAC GATTGTGACT AAAACCCTCT TCAATGCAGA CATTAGCGAA CAAGAAGTGC GCCAAGTTAG CCAAGCCATG GAAGATATTC AGGTTAACTT TACAATTATC TCGGAGCAAA GTGTACCGCT GCCGCGCTGG GTTCCAACGC GGGCTAATCG GGCGCTGGAA CATGCCAGCA AACAGATCGA TCAAGTGGTG CAGCGGGTGA TTCGCGAACG CCGTGCCAGT GGCGAGGATA CTGGCGATCT CTTGTCGATG TTATTGCTCT CAATCGATGA TGGCAATGGC CAAGGCATGA CCGACCAACA AGTGCGCGAT GAAGTAGTGA CACTGTTTTT GGCTGGTCAC GAAACCACTG CCAATACCTT AACTTGGTGC AGCTACTTGC TCAGCCAAGC GCCTGAGGTG CGCCAACGCT TGCAAGCCGA AGTTGATGAG GTGTTGCAAG GCCGCCCAGT TACTTTGCAA GATTTGCAAA AATTGCCCTA TACTGAAATG GTGATCAAAG AGACCTTGCG CATGTATCCG CCGGCTTATG CCTTGAGTGC CCGCGTGCCA ACCGAAAATA TTACGGTGCT TGGCCAAACG ATTACCCCAC GTCAGGCCGC CATGGTTTCG CCCTATGCTA TGCATCATAA TCCGCGTTAC TGGCCTGAAC CAGAGCGCTT CGACCCTGAA CGATTTAGCC CAGAGCAAGA ACGGGCACGC CATAAATATG CCTATATTCC ATTTGGGGCT GGCTCACGGG TCTGCATTGG CAACGTTTTT GCCATGATGG AAGCCCAATT ATTGTTGGCA ACCATGATGC AGCATTATGA TTTCACGCTT GATCCAACCC AACGAGTCGA GTATGATCCG CAAATTACCT TAGGGGTGAA ACATGGCTTG CGGGTACGTT TAGCTCAACG CCAACCAGTG GAGCAAAGCC TCGAATTTGC AAAAAGCTGA
|
Protein sequence | MTVKMLPGPK GTRLGGSRRD LHKYGPLGFF EYLASFGDFT TCRMGPFRVY LVNDPAGIQE LLVTNRDKVR KNGGDRELLS RFLGNGLLSN DGADHQKQRK LVQPAFHMKR IQAYAETMVE HTQAMLERWH DGAILDMDQA MMELTLTIVT KTLFNADISE QEVRQVSQAM EDIQVNFTII SEQSVPLPRW VPTRANRALE HASKQIDQVV QRVIRERRAS GEDTGDLLSM LLLSIDDGNG QGMTDQQVRD EVVTLFLAGH ETTANTLTWC SYLLSQAPEV RQRLQAEVDE VLQGRPVTLQ DLQKLPYTEM VIKETLRMYP PAYALSARVP TENITVLGQT ITPRQAAMVS PYAMHHNPRY WPEPERFDPE RFSPEQERAR HKYAYIPFGA GSRVCIGNVF AMMEAQLLLA TMMQHYDFTL DPTQRVEYDP QITLGVKHGL RVRLAQRQPV EQSLEFAKS
|
| |