Gene Haur_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3072 
Symbol 
ID5734944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3879388 
End bp3880758 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content52% 
IMG OID641280216 
Productcytochrome P450 
Protein accessionYP_001545838 
Protein GI159899591 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.863931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTTC AACAGATGCT CTGGAAGAGT GGCCCAACCG ACGCGCCATT GCCCCCGGTT 
GCTGATGGCT CGTTTTTGGT GGGTAGTTTA CAGGCAATGC TCAGCGATCC CATTGATTTT
TTCGTTAAGC AATATCAAAA ATTTGGCCCG ATTTTTCGCG TCAAAGCCCT CAATAATAAA
TTTACTATCT TGGCTGGGCC TGAGGCCTGC CTCTTTTTAG CCCGTGAAGG CACTAAGCAC
TTTAGCTCGT GGGAAACATG GCACTCAATG GATGCTGAAA TGGGTGCTTC GAAATCGCTG
ATTAGCGTCG ATGGCGAGCA ACATTCGCGA CTGCGAGCCT TGCAAAAACG CGGTTATAGC
CGCCAAACGA TTGAAACGCA ATTTCCCGAA GTGCTCAAGG TCGTTCATGG CTTCTTGGAT
CAATGGCAGG TTGGCACATC GAAAGCTACG GTAACTCAAC TCCAACGGTT GATTACCGAT
GAATTGGGGA TGTTGATTGC GGGCCAAGGC CCAGGTGATT ATATTGATGA TGTGCGGACA
TTTGTTCGAA TTGCGCTCAT GACCCACATC ACGCGCCAAC GCCCAAGCAT TTTGCGCATG
TTGCCTGAAT ATCGCCGTGC CCGCGACCGC AGCTTGGAAT TAAGTAAGCA GGTGCTGAAA
GCTCATCGAA GCGGCACACG CGATGCCAAC CGCCCGCCCA GCCTGATCGA CGATATTATC
GCCGCGACCA ATGACCCGAG CTTAATGCCT GAGGGCGATT TGGTTATGAC TGCCTTGGGG
CCATACATCG CAGGCTTGGA TACGGTTGCC AACACCATGG CGTTCTTGTT GTATGTGCTG
ACCACCAAGC CCGAGCTATA CGAACAAGTT GAGGCCGAAG CCGATGCACT GTTTGCCAAT
GGTGTGCCCG ATCCAGCCGA TTTGCGCAAA ATGGAAGTGT TGCATCGCGT AGTGCTCGAA
AACTTCCGCA TGTATCCAAT CGCGCCAGCC GTACCGCGCA CCGTCAAAGC ACCATTTGAG
TTTGGTGGCT ATCGCGTCGA TGCAGGCACA CGAACCTTAG TAGCAACTAC GGTCGGCCAT
TTCTTGCCCG AACTCCACCC TGAGCCAGAA AAATTCGATA TTGATCGCTA TTTGGCTCCG
CGTAACGAGC ACCGCATTCC TGGGGCATTT GCACCATTTA GCACTGGCTC GCACACCTGT
TTGGGCGCTG GCCTAGCCGA AGTCCAAATT ATGCTGACCA CTGCTGCCCT GTTGCACTAT
GCCAAGTTCG AAGCTGATCC AATCGATTAT AAGCTGAAGA AAGTTTTTGC CCCAACCCCA
GCCCCCGACA GTAGCTTCAA ATTGCGCTTG GTCACACGCC GTAACAGCTA A
 
Protein sequence
MTVQQMLWKS GPTDAPLPPV ADGSFLVGSL QAMLSDPIDF FVKQYQKFGP IFRVKALNNK 
FTILAGPEAC LFLAREGTKH FSSWETWHSM DAEMGASKSL ISVDGEQHSR LRALQKRGYS
RQTIETQFPE VLKVVHGFLD QWQVGTSKAT VTQLQRLITD ELGMLIAGQG PGDYIDDVRT
FVRIALMTHI TRQRPSILRM LPEYRRARDR SLELSKQVLK AHRSGTRDAN RPPSLIDDII
AATNDPSLMP EGDLVMTALG PYIAGLDTVA NTMAFLLYVL TTKPELYEQV EAEADALFAN
GVPDPADLRK MEVLHRVVLE NFRMYPIAPA VPRTVKAPFE FGGYRVDAGT RTLVATTVGH
FLPELHPEPE KFDIDRYLAP RNEHRIPGAF APFSTGSHTC LGAGLAEVQI MLTTAALLHY
AKFEADPIDY KLKKVFAPTP APDSSFKLRL VTRRNS