Gene Haur_5257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5257 
Symbol 
ID5737215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp29299 
End bp30639 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content45% 
IMG OID641282421 
Productcytochrome P450 
Protein accessionYP_001548012 
Protein GI159901767 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACG ATATTCGAGC ACTTCCATCA TTGCCTCAGT TTCCCTTTCT TGGGAATATT 
CTGTCATTTC GACAAGACAG ATTAAAGTTT CTCCGCGATC TCGGAAGCCA TGGAGATTTA
GGGGTGTTCT ATCTTGGGTC ATATCCTGTT GTTTTTATTA ATTCGGCAGA ATATGCCCAT
GGTATTCTTG TCCAGCATGC ACAATCGGTT GAAAAATCTT TGATGCTTCG TAAATATATG
CGACCGTTGC TTGGCAATGG ATTACTCACA AGTGAAAATA GCTTCCACAG GCGGCAACGC
AAACTCGTTG CACCAGCCTT TCAGCATCGC CATATTGCAA ACTACGCCAA TACCATATCA
GCCTATACGG ATGAGACACA AGCCCGTTGG CATCAAGGAC AACGGATTGA TATTGCCCAA
GAAATGATGC GGTTAACACT GCGTGTTATG AGCCAAACAC TCTTCTCTAC GGATATCAAT
ACAGAAGCAG ATGCATTGGG GCGTGCACTT ACGACGGTTC TCAATTATTC AAATAGTGTG
GCTAATACGC TTATCCATAT CCCATATCAT TGGCCTATTC CGCAGCACAA GCGGGTTCAC
GCTGCAATCG CACAACTGGA TACAACGATT CAACGCCTTA TCCACGAGCG AAGAACTCAA
CCTACATCAA CTAACGATTT GTTATCGGTC TTACTCCAAG CCCATGACGA CGATGATGGG
TCATTTATGA CCGACACACA AGTCCGTGAT GAACTTATGA CACTTTTTTT GGCAGGCCAC
GAAACAACCG CAAATGCCCT TACATGGACA TGGTATCTCC TTGCACACCA TCCTCATATC
GCAACGAAGA TCAAAGATGA GGTTGATAGC ACAGTTGGCA CACGACTCCC AACCATGGAT
GATTTATCAA AGCTTCCCTA TACATTACAA GTATTCAAGG AAAGTTTGCG ACTCTATCCC
CCCGTTTATA TGATTGCCAG AAAAGCATCA CAAGCATTCG AGCTAGGGAG CTATCATGTC
CCTGAGGGAA TGGCATTTGT CGTTAGCCCA TACACTATTC ATCGACGAGC CGACTATTTT
GATCATCCTG AGGACTTTAA CCCTGATCGG TTTGATACGT CGCACGAGGC AAGCATCCCA
AAAAATGCCT ATATTCCGTT CAGCTTAGGC CCACGAAACT GTATTGGGAA TCATTTTGCG
ATGATGGAAG GGCATTTGAT GTTGGCAATT ATCGCTCAAC GAATGCGCTT GCTCCTTGCG
CCAAACCAGC GAATCGTCCC TGATCCATCA ATCACATTAC GGCCCAAAGG GGCTATTCAT
ATGATTGTTG AGCGATTTTA G
 
Protein sequence
MSNDIRALPS LPQFPFLGNI LSFRQDRLKF LRDLGSHGDL GVFYLGSYPV VFINSAEYAH 
GILVQHAQSV EKSLMLRKYM RPLLGNGLLT SENSFHRRQR KLVAPAFQHR HIANYANTIS
AYTDETQARW HQGQRIDIAQ EMMRLTLRVM SQTLFSTDIN TEADALGRAL TTVLNYSNSV
ANTLIHIPYH WPIPQHKRVH AAIAQLDTTI QRLIHERRTQ PTSTNDLLSV LLQAHDDDDG
SFMTDTQVRD ELMTLFLAGH ETTANALTWT WYLLAHHPHI ATKIKDEVDS TVGTRLPTMD
DLSKLPYTLQ VFKESLRLYP PVYMIARKAS QAFELGSYHV PEGMAFVVSP YTIHRRADYF
DHPEDFNPDR FDTSHEASIP KNAYIPFSLG PRNCIGNHFA MMEGHLMLAI IAQRMRLLLA
PNQRIVPDPS ITLRPKGAIH MIVERF