Gene Hoch_2967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2967 
Symbol 
ID8545355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4079133 
End bp4080437 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content73% 
IMG OID646387644 
Productcytochrome P450 
Protein accessionYP_003267372 
Protein GI262196163 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00303691 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGGTCG ATTCCACCAA CCGCCCCCCC GCGTCCGAAA CCGCTCACTC GTCCCCGACC 
GCCGGGGCGC TGCCGGCGGT GCTGGAGGGC TTCGATCTGA GCGATCAGCC GCGCTTCGCC
GACGGCTTCC CGTACGAGGT GTTCGCGCGC CTGCGCCGCG AGGCCCCGGT GCTGTTCCAC
CCGCCGGGCC AGACCAAGGA CGGCGAGGGC TTTTGGGTGC TGAGCCGGCA CGCCGATATC
TGCGAGGCGG CCGCGAGCCC GGCGTTCTCG TCCCAGGGGG GCGGCGGGCG GCCGCACGGC
GGCACGCACA TCGACGACGC GCGCCCCGAG CTGCCCGGCG TGCTGATCAA CATGATGGAC
GACCCGCGCC ACGCCGACCT CAAGGACGTG CTGTCGCCGG CCGTGGGCCG GCAGGCGCTG
GTCGCGCTCG AGGGCGCGCT GCGGCCGTAC GTGAACGAGC TGGTGGACGG GCTGCTGGCG
CGCGGTGAGG CCGAGTTCGC GGCCGACGTG GGCGCGGCCG TGGGCGCGCG CGCGATCTCG
CTGCTGCTCG GCATCCCGCG CGAGGACTGG CCGCTGTTCG CGACCTGGAC GTCGGCGCTG
ATGGGCTTTG ACGATCGCGA GACCGCCGAG CCGTCGGAGC GCAGCCAGAA GATCCACATG
GACCTGTTCG GCTACGGCGC GCGGCTACTG GTGGCCCGGC GCGCGGCGCC GCAGGAGGAC
CTCGGCTCGC TGCTGGCCAA CGCGCAGCTC CGGCGCGACT CCGAGCGACC GCTCACGGAG
CTGGAGCGGC AGACGGCGTT CTGCCTGATG GTGCTCGCCG GGACCGAGTC GACGCGCAAC
ATGATCGCGG GCGGCGTGCT GGCCCTGGCC CAGCATCCGG CGCAGTGGCA GGCGCTGCGC
GATGAGCGCT CGCTGCTGCC GAGCGCGATC GACGAGATCC TGCGCTGGAC CACGCCCACG
CCGTACAACC GGCGCACGGC GACCCGCGAC GTGACGCTCG GCGACGCCCA CATCCGCGCG
GGCGATAAGG TGACGCTGTG GTGGACCTCG GCCAACCGCG ACGAGTCGGT GTTCAAGGAC
CCGATGGCGT TCGATGTCCG CCGCGACCCC AACCCGCACC TGGCGTTCGG CTACGGCACG
CACTGCTGCT TCGGCGACCA GCTCGGCAAG CTGGAGATGC GCCTGGTGCT TGACGCGATG
CTCGAGCGCG TGGCGCCGCT CGAGCTGAGC GGGCCGGTGG TCTGGGCCGC CAGCAACAAG
CACACGGTGG TGATGGATAT GCCGGTGGCC GTGCGCGCGC GCTGA
 
Protein sequence
MSVDSTNRPP ASETAHSSPT AGALPAVLEG FDLSDQPRFA DGFPYEVFAR LRREAPVLFH 
PPGQTKDGEG FWVLSRHADI CEAAASPAFS SQGGGGRPHG GTHIDDARPE LPGVLINMMD
DPRHADLKDV LSPAVGRQAL VALEGALRPY VNELVDGLLA RGEAEFAADV GAAVGARAIS
LLLGIPREDW PLFATWTSAL MGFDDRETAE PSERSQKIHM DLFGYGARLL VARRAAPQED
LGSLLANAQL RRDSERPLTE LERQTAFCLM VLAGTESTRN MIAGGVLALA QHPAQWQALR
DERSLLPSAI DEILRWTTPT PYNRRTATRD VTLGDAHIRA GDKVTLWWTS ANRDESVFKD
PMAFDVRRDP NPHLAFGYGT HCCFGDQLGK LEMRLVLDAM LERVAPLELS GPVVWAASNK
HTVVMDMPVA VRAR