Gene Haur_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2227 
Symbol 
ID5734114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2830898 
End bp2832262 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content52% 
IMG OID641279368 
Productcytochrome P450 
Protein accessionYP_001544995 
Protein GI159898748 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTG CTGTTCGTTC GCTGCCGCAT CGGCGCAGCC GTTTTGCGCT CGATATTGCG 
CTCGAAGTTA AACGCCAAGG CACATTACAA TTTTTTGAAT CGACGTGGCG ACGCTATGGC
GACCTTGCCC ACCTGCAACT TGGCTCAAAA GATATGTTTT TGGTGGTGCA TCCTGATCAC
GTTCGGCGGG TGATGGTCGA GCAGCGTGAC ACTTATTCAA AAAAGGCTAG CTATGAAGGT
GTGCGCAAAT TATTGTTAGG CGATGGCTTG GTAGCCAGCA CTGGCAGCCT TTGGCGGCGG
CAGCGCAAAT TGATGGCTCC ATTTTTCACG CCACGGGCAA TTGAAACCTA TTTACCAATT
ATTGTTGAGG ATGGGGCATG GTTTCGCGAA CGTTGGAGCG CTGCTGCCAA ACAGGGCGAA
CCACTCGATA TTCTGACCGA AATGTCGGTG CTCACAGCGT CGATCATCTT AAAAAGTATG
TTTAGCCTTG AGGCCGATGA CACGATTGCT TGGGTCAAAC ATGCGGTCGA AACGATGATT
GGCTTTGCCT CAAGCCGCCA GATGAACCCA CTGCATGCGC CACTCTGGAT GCCAACGCCC
AAAAATCGGG CTTATTTGGA AGCTCGCAAC CGGGTTAATC AGTATATTCA GGGCATTATC
GCCGAGCGCC AACGCCAAGC GCCGGATGAA TGGCCTAACG ATTTGCTAAC CCGCATGATG
CAGGCTCGCG ACGAAGAAAC AGGCGAACCA ATGTCTACAG TTTTGTTGCG TGATGAGGCA
ATTACGGTCT TTTTTGCTGG CCACGAAACC ACCGCCCGCA CGCTATCATT CTTGTGGTAT
GCCTTGGCCA ACAATCCCGA CGTTGCCGAA CGCATGCAGG CCGAAATCGA TAGTGTGTTA
GGCGATGCTG CACCAACTCT CGATCATCTC AAACAACTGC CTTATACCTT GCAAGTAATC
AAAGAAACCT TGCGACTTTA CCCCGCCGCC CCGATGTATG CCCGTGATGC GGTTGCTAGT
GATGAATTTG CAGGCATTAA GGTTCCAGTT GGCTCACGGA TGACGATCAT GCCCTATCTT
ACCCATCGCC ACCCTGATTT TTGGGATAAG CCGTTGCGCT TTGATCCTGA TCGTTGGTTG
CCAGAACGCG AAGCCCTGCG CCACCCCTTT GCCTATCACC CATTTGCGGC TGGGCAGCGG
ATTTGCATCG GCAATAATTT TTCGTTGTTT GAATCGCATG TGGTGGTTGC GATGCTGGCC
CGCCATTTTG CCCTGCGCAC CGTGCCAAAT CACACGCCGC AAGTCTGGAT GGATGGCACA
CTTGGCTCGC GCAATGGCTT GCCAATGTTC ATCACCCAGC GCTAA
 
Protein sequence
MTLAVRSLPH RRSRFALDIA LEVKRQGTLQ FFESTWRRYG DLAHLQLGSK DMFLVVHPDH 
VRRVMVEQRD TYSKKASYEG VRKLLLGDGL VASTGSLWRR QRKLMAPFFT PRAIETYLPI
IVEDGAWFRE RWSAAAKQGE PLDILTEMSV LTASIILKSM FSLEADDTIA WVKHAVETMI
GFASSRQMNP LHAPLWMPTP KNRAYLEARN RVNQYIQGII AERQRQAPDE WPNDLLTRMM
QARDEETGEP MSTVLLRDEA ITVFFAGHET TARTLSFLWY ALANNPDVAE RMQAEIDSVL
GDAAPTLDHL KQLPYTLQVI KETLRLYPAA PMYARDAVAS DEFAGIKVPV GSRMTIMPYL
THRHPDFWDK PLRFDPDRWL PEREALRHPF AYHPFAAGQR ICIGNNFSLF ESHVVVAMLA
RHFALRTVPN HTPQVWMDGT LGSRNGLPMF ITQR