Gene Haur_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4177 
Symbol 
ID5736038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5326221 
End bp5327600 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content51% 
IMG OID641281331 
Productcytochrome P450 
Protein accessionYP_001546937 
Protein GI159900690 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00497532 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTA AGATGTTACC TGGCCCCAAA GGAACCCGCC TCGGTGGAAG TCGGCGCGAT 
TTACATAAAT ATGGCCCGTT GGGCTTTTTC GAATATCTAG CAAGTTTTGG CGATTTCACG
ACCTGTCGCA TGGGGCCGTT TCGGGTCTAT CTGGTCAATG ATCCGGCTGG GATTCAAGAG
CTTTTGGTGA CCAATCGCGA TAAGGTGCGC AAAAATGGCG GCGATCGCGA GTTGCTTTCG
CGCTTTTTAG GCAATGGTTT GCTCAGCAAT GATGGCGCTG ATCATCAAAA GCAGCGCAAA
TTGGTTCAGC CTGCGTTTCA TATGAAGCGC ATTCAGGCCT ACGCTGAAAC CATGGTTGAG
CATACCCAAG CCATGCTCGA ACGTTGGCAC GATGGCGCGA TTCTGGATAT GGATCAGGCC
ATGATGGAAT TGACCTTGAC GATTGTGACT AAAACCCTCT TCAATGCAGA CATTAGCGAA
CAAGAAGTGC GCCAAGTTAG CCAAGCCATG GAAGATATTC AGGTTAACTT TACAATTATC
TCGGAGCAAA GTGTACCGCT GCCGCGCTGG GTTCCAACGC GGGCTAATCG GGCGCTGGAA
CATGCCAGCA AACAGATCGA TCAAGTGGTG CAGCGGGTGA TTCGCGAACG CCGTGCCAGT
GGCGAGGATA CTGGCGATCT CTTGTCGATG TTATTGCTCT CAATCGATGA TGGCAATGGC
CAAGGCATGA CCGACCAACA AGTGCGCGAT GAAGTAGTGA CACTGTTTTT GGCTGGTCAC
GAAACCACTG CCAATACCTT AACTTGGTGC AGCTACTTGC TCAGCCAAGC GCCTGAGGTG
CGCCAACGCT TGCAAGCCGA AGTTGATGAG GTGTTGCAAG GCCGCCCAGT TACTTTGCAA
GATTTGCAAA AATTGCCCTA TACTGAAATG GTGATCAAAG AGACCTTGCG CATGTATCCG
CCGGCTTATG CCTTGAGTGC CCGCGTGCCA ACCGAAAATA TTACGGTGCT TGGCCAAACG
ATTACCCCAC GTCAGGCCGC CATGGTTTCG CCCTATGCTA TGCATCATAA TCCGCGTTAC
TGGCCTGAAC CAGAGCGCTT CGACCCTGAA CGATTTAGCC CAGAGCAAGA ACGGGCACGC
CATAAATATG CCTATATTCC ATTTGGGGCT GGCTCACGGG TCTGCATTGG CAACGTTTTT
GCCATGATGG AAGCCCAATT ATTGTTGGCA ACCATGATGC AGCATTATGA TTTCACGCTT
GATCCAACCC AACGAGTCGA GTATGATCCG CAAATTACCT TAGGGGTGAA ACATGGCTTG
CGGGTACGTT TAGCTCAACG CCAACCAGTG GAGCAAAGCC TCGAATTTGC AAAAAGCTGA
 
Protein sequence
MTVKMLPGPK GTRLGGSRRD LHKYGPLGFF EYLASFGDFT TCRMGPFRVY LVNDPAGIQE 
LLVTNRDKVR KNGGDRELLS RFLGNGLLSN DGADHQKQRK LVQPAFHMKR IQAYAETMVE
HTQAMLERWH DGAILDMDQA MMELTLTIVT KTLFNADISE QEVRQVSQAM EDIQVNFTII
SEQSVPLPRW VPTRANRALE HASKQIDQVV QRVIRERRAS GEDTGDLLSM LLLSIDDGNG
QGMTDQQVRD EVVTLFLAGH ETTANTLTWC SYLLSQAPEV RQRLQAEVDE VLQGRPVTLQ
DLQKLPYTEM VIKETLRMYP PAYALSARVP TENITVLGQT ITPRQAAMVS PYAMHHNPRY
WPEPERFDPE RFSPEQERAR HKYAYIPFGA GSRVCIGNVF AMMEAQLLLA TMMQHYDFTL
DPTQRVEYDP QITLGVKHGL RVRLAQRQPV EQSLEFAKS