Gene Hoch_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2973 
Symbol 
ID8545361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4119152 
End bp4120555 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content74% 
IMG OID646387650 
Productcytochrome P450 
Protein accessionYP_003267378 
Protein GI262196169 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0248286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000662302 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGAGA CCGTTGACCC CAGCTCAGCG CAGACCGCCG GCGATCTCGG CCGCGGCCCG 
TCGAGCGCCT TCTTGACCAC GCTGCGCTAC TCGATCGACC CGGTGAGCCT GTACCGGACC
TTGGCCCGCG ACTACGGCGA CGGCCACACC GTGCACATGC CCATGCTGCT GGGCGATGTG
GTGGGCGCGA TCAGCCCGCA GAGCGCGCAG GACATCCTCA CCGCCGACGC CGCGAGCTTC
GACATCTTCA GCCCCGAGTC GCTGGCCATG GTCTTCCGCC CGCGCTCGGT GGTCATGCTC
TCGGGCCAGG AGCACGCGCG CGAGCGCAAG CTCTTGATGC CGCCCTTCAG TCCCCGCCAG
GTGCTCGCCA ACTACGCCGG CACCATGCAG GAGACCGCGC TGGCGTACGC CGCCGAGGTC
GCCGACGGCC GGCCCTTCGT CATGCAGGAG CTGGCCCAGC GGGTGCTGCT GCGGGTGGTG
GTGCGCGACG TCTTCGGGGT CACCGAGGAC GCCGAGCTCG ACGAGCTGGA GCTGCGCATC
CGCGAACTGT TCGAGGCCTC GTCGCCGGCC CTGATCTTCT TCCCGCCGCT GCGCCACCGC
TTCGGCGGCG TCGGCCCCTA CGCCACCTGG GAGCGCGCCG ATCGGCGGCT CACGCGCCTG
ATCCACGACC TCATCGCGCG CCGCCGCGCC GAGCCCCGCG GCGACCGGGT CGACGTCCTG
TCGCTGATGC TGTCGGCCCG CTACCCCGAC GGCAGCGCGC CCAGCGACGA GGTCCTGCAC
GACGAGCTGA TGGCCCTGTT CTTCGCCGGC CACGCGGCCA CGGCGACCTC GATCGCGTGG
GTGTTCTACT GGGCGCACCG GCACCCCGAG GTGCTGCACA AGCTGCGCGA CGAGCTGGCC
GCGCTGCCCT GGGACGCCGA GCCCGCGCGC TACACCGAGC TTCCCTACCT CGACGCGGTG
TGCAACGAGA CCCTGCGCAT CTACCCGCCG GTGGCCGACC TGTACCGCAA GCTGCGCGTG
CCGCTGCGCG TCGGCGGCCG CACCGTCCCG GCCGGCACCG GCGTGGCCGT GCTGGTCACC
TGCATCCACG CGCGCCCGGA GCTGTTCCCC GAGCCGCTGC GCTTCCGCCC CGAGCGCTTC
GCCGAGCGCC GCTACAGCCC GTTCGAGTTC CTGCCCTACG GCGCCGGCGC GCGCCGCTGC
CTGGGCGCGT CCTTTGCCCA CCAGGCGCTG CAGGTCGTGG TCGCCACGAT CCTGCGCCGC
TGGGAGCTGG CGCTTGTGCG CCGCGAGCAG AAGGCCGTGC GTCAGGGCGT CGGCGTCGGG
CCCAAGCACG GCGTGCCCAT GCGCGTGGTG CGCTCGCTCG TGCCCGAGCG CAGCGCGGCG
CAGCCGGCCG GGGGGCGCCC GTGA
 
Protein sequence
MPETVDPSSA QTAGDLGRGP SSAFLTTLRY SIDPVSLYRT LARDYGDGHT VHMPMLLGDV 
VGAISPQSAQ DILTADAASF DIFSPESLAM VFRPRSVVML SGQEHARERK LLMPPFSPRQ
VLANYAGTMQ ETALAYAAEV ADGRPFVMQE LAQRVLLRVV VRDVFGVTED AELDELELRI
RELFEASSPA LIFFPPLRHR FGGVGPYATW ERADRRLTRL IHDLIARRRA EPRGDRVDVL
SLMLSARYPD GSAPSDEVLH DELMALFFAG HAATATSIAW VFYWAHRHPE VLHKLRDELA
ALPWDAEPAR YTELPYLDAV CNETLRIYPP VADLYRKLRV PLRVGGRTVP AGTGVAVLVT
CIHARPELFP EPLRFRPERF AERRYSPFEF LPYGAGARRC LGASFAHQAL QVVVATILRR
WELALVRREQ KAVRQGVGVG PKHGVPMRVV RSLVPERSAA QPAGGRP