Gene Haur_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1966 
Symbol 
ID5733855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2402538 
End bp2403773 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content50% 
IMG OID641279110 
Productcytochrome P450 
Protein accessionYP_001544737 
Protein GI159898490 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.940612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTG TATCGCGTGA AACCAAGCCC ACTCAACCAG TCTTGAATTT GACTGATCGC 
CAATTCAAGG CTGATCCGTT TAGTATTTAT GCTCAGTTAC GCCGTGATAA TCCAGTTGCT
AAAGCTAAAC TTGGCTCGCT GCCGACATGG ATTGTTACCC GCTACGATGA TGTCGTTGAA
ATTTTGAAAA ACGATCGGGT CTTTGTCAAA AACTATAAGA ATGCCCAATC GTTGGAGCAA
CAACGCAAAC GCCCATGGAT GCCCGCATCG CTTCGAGCGC TCGAAAGCAA TATGCTCGAT
CAAGATAACC CTGACCATCT GCGCTTACGT TCATTGGTCC ACAAAGCTTT TACCCCTCAG
CGCATGGAAG AAATGCGGCC ACGCATTCAG TCAATTGCTG AATCATTATT AATCAGCAGC
CAACAACGTG GTCGCGGCGA TTTGATTGCC GATTTTGCCT TGCCCCTGCC TTTGACCGTG
ATCGTCGAAT TGCTTGGCAT TCCAACTGAG GATCGCCAAA AATTTCATCG TTGGGCCAAA
TATGTACTGA ACAGCCCAAC CATGCTGAAT ATGCTATTGG CAATTCCGGC AATTATGGCT
CAGATGAAAT ATCTCAAACA ACTTTTTGCC AAGCGGCGCA GCAACCCCCA AGACGATTTG
CTGACGGCGT TGGTGCAAGC TGAGGCCGAT GGTGATCGTT TCAGTGAAGA TGAATTAGTC
GCCATGGTCT TTTTGCTGAT GCTCGCAGGC CACGAAACCA CGGTCAATCT GATTTCATCG
GGAACCCTAG CGTTATTGCA GCATCCTGAG CAGTTGGCCT TGTTGCGACG TTCGCCAGAG
TTGATCAAAT CGGCAGTTGA GGAATTGGTG CGGTTTACTG CGCCAGTTGA AACCGCGACC
GAGCGCTACG CTGCTGAAGA TGTGATTATT GCCGACACCA AGATTGCCAA AGGCGAAATG
GTTTTGGTCG CGCTGGCCTC AGCCAATCGG GATGAACGCC AATTTACTAA CCCTGATCAA
CTCGACATAA CCCGAGAAAA GAATCGCCAT GTTGGTTTTG GTTTGGGCAT TCACTATTGC
TTAGGTGCGC CTTTAGCCCG CATGGAAGCC CAAATTGGCC TGCAATTATT GACTGATCTG
CGCCCAAACT TGCGCTTAGC CGTGCCCGCT GAGCAGTTGC GTTGGCGTTC GACCGCAGTT
GTGCGCGGCC TCGAAGCCTT GCCTGTGGAG TGGTAA
 
Protein sequence
MSSVSRETKP TQPVLNLTDR QFKADPFSIY AQLRRDNPVA KAKLGSLPTW IVTRYDDVVE 
ILKNDRVFVK NYKNAQSLEQ QRKRPWMPAS LRALESNMLD QDNPDHLRLR SLVHKAFTPQ
RMEEMRPRIQ SIAESLLISS QQRGRGDLIA DFALPLPLTV IVELLGIPTE DRQKFHRWAK
YVLNSPTMLN MLLAIPAIMA QMKYLKQLFA KRRSNPQDDL LTALVQAEAD GDRFSEDELV
AMVFLLMLAG HETTVNLISS GTLALLQHPE QLALLRRSPE LIKSAVEELV RFTAPVETAT
ERYAAEDVII ADTKIAKGEM VLVALASANR DERQFTNPDQ LDITREKNRH VGFGLGIHYC
LGAPLARMEA QIGLQLLTDL RPNLRLAVPA EQLRWRSTAV VRGLEALPVE W