Gene Haur_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4119 
Symbol 
ID5735980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5268004 
End bp5269293 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content51% 
IMG OID641281273 
Productcytochrome P450 
Protein accessionYP_001546879 
Protein GI159900632 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACGGC GTTTAGCTCC TCAGGCTGGC TGTCCGCCTG GCTCGTTTGG TGCGCCCGTG 
ATTGGCGAGA TTCGCGAGTG GGCCGCTGAC CCATTGCAAT TTGCCCAAGC ACGCGCCCAA
CGCTATGGCC CGATTTGGTC GACCCATCTT TTGGGTCGGC CTTGTGTCGT TTTATTGGAG
CCAGCAGGCA ATCGCTTTAT GCTGAGCCAA GGTTTACAGT ATTTTTCGTG GCGAGCTGGC
TGGGGCCGGG CTATGTTGCG GCTCATGGGT GGTGGTTTAT CGTTGACTGA TGGTCATCAG
CACGATCAGC AACGTAGCTT GCTCAAACCT GCTTTTGCCC ATGCCGCCCT TCAGCAACTG
CAACCACAAA TTCAACACCT GATTCGGCAA CAATTGCAAA CCTGGCCCGA TGCTGAACCG
ATTTGTTTGT TGGAACGCTT GCAAACGTTG GCGTTTGACG TGGCATTGCT CGTGGTTTGT
GGCCGCACGC CAGCCCCAAT TGCCGAGGCT TTGCATCATG ATTTTGCGGC GTTTACCGCT
GGTTTGTTTA CCCCGTTGCC CTATCCAATT CCAGCAACAC CTTATTTTCG GGCGCAAAAA
TCTGGCGAGC GGTTGCGCCA AACCTTAAGT TATTTAATTG AACTCCGCCG TTTGAACATA
GCAGCTGATG CCTTGGATAG CTTAAGTTTG ATGCTCCAAG CTGAGCCAAA TCGCCCTGAT
GATGAATTAA TTAGCGAATT ATTATTGTTG TTATGGGCTG GCCACGATAC CGTTGCCTCG
TTGCTGACTT GGATTTGTAT TGAATTGGCG CAACACCCCG AAATCCTGCA ACGTTTGCGT
CAAGAATTAA GTACTAATCA CCATAGTTTG CTCGATCATG TGCTGCGTGA GGCTGAGCGG
TTACATCCGC CAGCGCCTGG TGGTTTTCGC GGAGTTGTTG AAGCTTTTGA ATATGCTGGC
TATCATGTGC CGCAGGGCTG GCTGGCGATG TATTCATCAG TCTATACCCA CCAGATGCCA
AGTCTGTGGC ATAATCCTAC TCAGTTCAAT CCCAATCGCT TTGCAGCGCC TTGTAACGAA
GGCAAACAAG CCTATAGTTT GGTTGGCTTT GGCGGCGGGC CACGGATTTG CATTGGTTTG
GCGTTGGCTC AAATTGAAAT GCGCCTAGTG TTACGCGAAT TGCTCGCCAA TTATCAATGG
CAACTTGTGC CTAACCAAGA TTTACGGCCT GTCTGGTTGC CAACCAATCA GCCGCGTTCA
GGTGGCCTGA TCACAATTCA GCGGTTTTAG
 
Protein sequence
MKRRLAPQAG CPPGSFGAPV IGEIREWAAD PLQFAQARAQ RYGPIWSTHL LGRPCVVLLE 
PAGNRFMLSQ GLQYFSWRAG WGRAMLRLMG GGLSLTDGHQ HDQQRSLLKP AFAHAALQQL
QPQIQHLIRQ QLQTWPDAEP ICLLERLQTL AFDVALLVVC GRTPAPIAEA LHHDFAAFTA
GLFTPLPYPI PATPYFRAQK SGERLRQTLS YLIELRRLNI AADALDSLSL MLQAEPNRPD
DELISELLLL LWAGHDTVAS LLTWICIELA QHPEILQRLR QELSTNHHSL LDHVLREAER
LHPPAPGGFR GVVEAFEYAG YHVPQGWLAM YSSVYTHQMP SLWHNPTQFN PNRFAAPCNE
GKQAYSLVGF GGGPRICIGL ALAQIEMRLV LRELLANYQW QLVPNQDLRP VWLPTNQPRS
GGLITIQRF