Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2522 |
Symbol | |
ID | 5734400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3222114 |
End bp | 3225275 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279662 |
Product | cytochrome P450 |
Protein accession | YP_001545288 |
Protein GI | 159899041 |
COG category | [P] Inorganic ion transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTT CCAGCCCAAT TCGGTATATT CCCCAACCAC CAACCCGCCC AATCGTCGGC AATGTGCCCG ATATTGGCAT GGAAACGCCT GTCCAGAATT TGATGAAATT AGCTCAGCAT TATGGGCCAA TTTTCCGCGT GAGTTTTCCT AATCGCAGCG TGTTGGTCGT TTCTTCGGCT GAACTTGTGG CCGAAATTAG CGATCAACAA CGTTTCGATA AATTATTGCA TGGGCCATTA ATTCAAATTC GCGATTTTGC AGGCGATGGC TTGTTTACGG CCTATACCGA GGAAGCCAAT TGGAGCAAAG CCCATCGTTT ATTGATGCCA GCCTTCGGGC CAGCCAGTAT GCGTAATTAT TTCGACGACA TGCTCGATAT TGCCGACCAA TTATTTACTA AATGGGAGCG TCAAGGGCCA GAAACCGATT TTGATGTGGC CGATAATATG ACGCGCCTAA CGCTCGATAC GATTGCCTTA TGTGGTTTTG GCTATCGGTT TAATTCGTTT TATCAACGCG AAATGCACCC ATTTGTTGAA GCAATGGTGC GAGCCTTAGC CGAGGCTGGG GCACGCGCCC GCCGTTTATC AATTCAAACG AAATTAATGC GCTCAACCCA GCGTCAATAT GAAGCTGATA TGCAGTATAT GCACGGCATC ACCGATGAAT TAATTGCTAA ACGGCGCAGT TTGCCAAGCA ACGAAGTTCC CAACGATCTG CTAGGATTAA TGCTCAATGC CAAAGATTCG ATCACCGGTG AAGGCTTAGA TGATGCCAAT ATTCGCAATC AACTGGTGAC ATTTTTGATT GCTGGCCACG AAACCACCAG CGGCCTGCTC TCGTTTGCAA CCTACTTTTT GCTCCAACAG CCTGAAATTT TGCAACGCGC TCAAGCCATC GTCGATCAAG TGCTCGGCGA TCGGCTGCCA CGCTACGAAG ACTTGGCCAA ACTGGGCTAC CTCGACCAAA TTTTGCGCGA AACCTTGCGG CTCTGGCCAA CCGCGCCTGT TTTTGGGGTT TATGCCAAGC ACGATACTAA CATTGGTGGC TTTCCGATTA AGCAGGGCGA AAAATTCATA GCCTTATTGC CAACTTTGCA CCGCGATCCC AAAGTTTGGC TCAACCCCAA CCAATTTGAT CCCGATCGCT TTGCGCCTGA AGTGAGGGAA CAAATCCCTG AGCACGCTTG GAAGCCATTT GGCAATGGCC AACGCGCCTG TATTGGGCGT TCATTTGCCA TGCAAGAGGC CAGCTTGGTT TTAGCAATGA TGCTGCAACG TTTTGAATTA TCGCAACCGC AACCCTACCA GTTGCATGTC AAAGAAACCC TAACGCTCAA ACCTGAAGGC TTGACCGTTC GAGCACGGGT ACGCAAAAAC ATCGTGCGCA GCACCAAGCC AACTCAGCCA AATGTAGCAA TTCAATCAAA CCCAAATCAA GCCCAACACA ATATCCCATT GCTGGTGCTG TATGGCTCTA ATTCTGGCTC ATCTGAAGCC TTCGCTCGCC GAATTGCCAG CGATGGTGAG GCACGCGGTT ACCAAACAAG CGTGGCTGCG CTCAATAATT ATGTCAATAA ATTACCAACC ACCGGAGCAG TGAGCATCGT GGCGGCCTCA TACAACGGCC AGCCTGCCGA TAATGCCCAA GCCTTCTGTC AATGGTTAGC TGGCGTTGAG CCAAACTCGC TCAAAGGCGT GCGCTATAGC GTTTTTGGCT GTGGCAACCG CGATTGGCAG AGCACCTACC AAGCTGTGCC GACTCAAATT GATCAACACT TGCAGGCCGC CGGAGCCGAA CGTTTGCTTC AACGCGGTGC AGCCGATGCC CGCAGCGATT TCTTTGGTGA TTTTGAGCGT TGGTATGCGC CGTTTTGGCA AACCCACAAC CAAACATTTG CAATCGCAAG CGCCGAAATT AACAGCAAAC CACTGTACAA GGTCGAATTA CTGCCATCAA GCAGCGATCA GTTGGCCCAA CAAACGGGCT TTATGTTTGC TAGCGTGCTC GAAAATCGCG AATTAGTTGA TCTCAGCTCG CCTTTGGGTC GTTCAAAACG CCACATCGAA TTACGTTTGC CAAACGAACT GCAATACCAA GCTGGTGATT ATTTAGCGAT CTTGCCGCAA AATCATCCTA GCCTAATCGA GCGGGCTTGC AAACATTTTG GGCTAAAACC TGAACAAACT ATAATTTTGC ATGCTACACG CGGGGCTGCC AACCTGCCAA TTGATCGCCC GATTAGCCTA GGTGAATTGC TGAGCAGCCA CGTTGAATTA GCAACTCCCG CCACGCAGCG CGATTTGGAG TTGTTGGCGC AGAAGAATGT TTGTCCGTCA CACCAAATTC ATTTAGCTGC ACTGGCCGCA GATCACGAAC GCTATACCAC CGAGATTTTG CAAAAACGCC TGAGCTTGTT GGATATGCTT GAGCAATATC CATCCTCAGT GCTTGATTTT GGCGAATTTT TAGAGCTATT GCCAGCAATG CGAGTGCGCC AATATTCAAT TTCATCGTCA TCATTAGTCA ATCCAAACCA AGCCAGCCTA ACCGTGGCGG TGGTTGATGC CCCAGCATGG TCGGGTAAGG GCCAGTTCTA TGGCACGGGT TCGAGCTATT TGGCCCGTTT GCAAGTTGGC GATCAGATTG CGGTGAGCTT GCGTCAACCA CATATTCCGT TTCGCCCACC GAGCGCCAAC AGCACACCAT TACTGATGAT TTGTGCAGGC ACTGGTTTAG CGCCATTCCG TGGTTTTATC CAAGAGCGCG TCGCTCGCCA AGGCCAAGGC GAAGCACTTG GCCCGAATGC CCTGTTTTTT GGCTGCGACC ATCCTGAGGT TGATCTGCTT TATCACGAAC AGATTCAAGC TTGGCAAAAA GCTGGAGTGC TAGAATTTTT CCCAGCATTC TATCGCCAGC CAGTTGGTGA AGTCAGCTTT GTGCAACATC GGCTCTGGCA AGAACGCCAG TATGTGTGGA GCTTAATCGA ACAAGGTGCA GTAATAGCCG TTTGTGGCGA CGGTCGCTCC ATGGCTCCAG CTGTGCGTGA AACCTTGGCG CGAATCTATG CCGAAGCAAC TGGCAGCGAG CAAACAGCAG GCATGGCATG GATTGCCGAA ATCGAGCAAG CAGGACGCTA TGTCGCCGAT GTTTTCGGCT AA
|
Protein sequence | MSISSPIRYI PQPPTRPIVG NVPDIGMETP VQNLMKLAQH YGPIFRVSFP NRSVLVVSSA ELVAEISDQQ RFDKLLHGPL IQIRDFAGDG LFTAYTEEAN WSKAHRLLMP AFGPASMRNY FDDMLDIADQ LFTKWERQGP ETDFDVADNM TRLTLDTIAL CGFGYRFNSF YQREMHPFVE AMVRALAEAG ARARRLSIQT KLMRSTQRQY EADMQYMHGI TDELIAKRRS LPSNEVPNDL LGLMLNAKDS ITGEGLDDAN IRNQLVTFLI AGHETTSGLL SFATYFLLQQ PEILQRAQAI VDQVLGDRLP RYEDLAKLGY LDQILRETLR LWPTAPVFGV YAKHDTNIGG FPIKQGEKFI ALLPTLHRDP KVWLNPNQFD PDRFAPEVRE QIPEHAWKPF GNGQRACIGR SFAMQEASLV LAMMLQRFEL SQPQPYQLHV KETLTLKPEG LTVRARVRKN IVRSTKPTQP NVAIQSNPNQ AQHNIPLLVL YGSNSGSSEA FARRIASDGE ARGYQTSVAA LNNYVNKLPT TGAVSIVAAS YNGQPADNAQ AFCQWLAGVE PNSLKGVRYS VFGCGNRDWQ STYQAVPTQI DQHLQAAGAE RLLQRGAADA RSDFFGDFER WYAPFWQTHN QTFAIASAEI NSKPLYKVEL LPSSSDQLAQ QTGFMFASVL ENRELVDLSS PLGRSKRHIE LRLPNELQYQ AGDYLAILPQ NHPSLIERAC KHFGLKPEQT IILHATRGAA NLPIDRPISL GELLSSHVEL ATPATQRDLE LLAQKNVCPS HQIHLAALAA DHERYTTEIL QKRLSLLDML EQYPSSVLDF GEFLELLPAM RVRQYSISSS SLVNPNQASL TVAVVDAPAW SGKGQFYGTG SSYLARLQVG DQIAVSLRQP HIPFRPPSAN STPLLMICAG TGLAPFRGFI QERVARQGQG EALGPNALFF GCDHPEVDLL YHEQIQAWQK AGVLEFFPAF YRQPVGEVSF VQHRLWQERQ YVWSLIEQGA VIAVCGDGRS MAPAVRETLA RIYAEATGSE QTAGMAWIAE IEQAGRYVAD VFG
|
| |