Gene Hlac_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0554 
Symbol 
ID7401689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp576036 
End bp577562 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content70% 
IMG OID643707619 
Productcytochrome P450 
Protein accessionYP_002565226 
Protein GI222478989 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCAACC CGATCACCCG CTCCGACGAC GATCGAGCGC GACAGTCCGG CGGTAGCGAA 
CGCACCGACA CCGCCGACGC CGACGTCGAC GTCGCGGATG CTCCGCCGCC GCCAGATCCC
GGCGGACTCC CGGTGCTCGG CAACGTCCAC GAGCTCGCGA GCGATGCGCT CGGGTTCTAT
GAACGCCGAT CCGCGGAGTA CGGTGGGATC GTCCGGTACG ACGTGTTCGG GACGGAGAGC
TACCTCGTGA CCGATCCCGG GGCGATCGGG CGGATCCTCG TCGAAGACCA CGATCGGTTC
GTGAAAGGCG AGATGCCTCG TGAGCAGCTC GGCGGCCTCC TCGGAGACGG ATTGTTCCTC
GCCGAGGGGG AGGCGTGGCG CGAGCAGCGC ACGGCGATCC GATCGGCGTT CTTCCGCGAG
CGGGTCGCCG CCTACGGCGA CTCGATGGTC GAACACGCTC GCGAGGCGGT GGATTCGTGG
GGGGACGGCG CGGTCGTCGA CGTGCACGCG GCGTCGACCG AGTACGCCTT CGCCGTCCTC
GCGGAGAGCC TGCTCGGAAG CGATATCGAG GGCGAGCGCG AGACCGTTCG GGCGGCGGCC
GAGGCTATCA CCGATCGGTT CGACATGAGT CGTCCGACCT CGTTCCTCCC CGAGTGGCTC
CCAACTCCCG CAAACCGACG GTACCGACGC CGGCTCGGCG CCCTTCGCGC GACGATACGC
GACCTCGTCA CTGAGCGGCG GGCCGCCGGC CCGCCCGCCG ACCCGAGCGC CGCGGACGAC
CTCCTCGGTA CCCTCGTCGC GGCCGCGAAG CTTGGTGCGC TCGACGACGA GGAGCTAGTC
GATAACGCGG TGACGTTCCT CTTCGCCGGC CACGAGACGA GCGCGCTCGG ACTGACGTAC
GCGCTGTACT GTCTCGCGCG CCGCCCCGCG TTTCAGGACC GCATTCGCTC CGAAATCGCA
CCACTCCACG GAGATCCGAC GCCCGCGGAC CTCCGCGAGT GCCCCGCGCT GACCGCGGCG
GTCGACGAGG CGCTCCGCCT GTATCCCCCG GTCCACTCCT TTTTCAGGGA GCCGACCGAG
CCGATCGCGC TCGGCGAGTA CCGCATCCCG TCCGGGGTCG TGCTCACGCT TGCACCGTGG
TCGGTCCACC GCGACGGGCG CTGGTGGAAC GCTCCCGAGA CGTACCGCCC CGAACGCTGG
CTCCGTGAGA CCGAGGACGG CGGTGTCGTG CACGGCGACG ACCGTTCCGG CCCGGCGGTG
GGGGAACACC CCGAACACGC CTTCTTCCCG TTCGGTGGGG GGCCGCGACA CTGCATCGGG
ATGCGCTTCG CGCGACAGGA ACTCCGGCTC GCGGTCGCGA CGATTCTCCG TCGGGTGCGG
CTGGAACCCG TGACGGAGGA ACTGTCGCTG CAGGCGAGCG CGAACACCCG ACCGGACGGA
CCGGTACACG TGCGGATCCT CACGCGAGAC GAGCCGTCCG ATCCAGAGTC GCTCGACCAA
GACGAGCCGT CCAAGACTAG TTGGTAA
 
Protein sequence
MCNPITRSDD DRARQSGGSE RTDTADADVD VADAPPPPDP GGLPVLGNVH ELASDALGFY 
ERRSAEYGGI VRYDVFGTES YLVTDPGAIG RILVEDHDRF VKGEMPREQL GGLLGDGLFL
AEGEAWREQR TAIRSAFFRE RVAAYGDSMV EHAREAVDSW GDGAVVDVHA ASTEYAFAVL
AESLLGSDIE GERETVRAAA EAITDRFDMS RPTSFLPEWL PTPANRRYRR RLGALRATIR
DLVTERRAAG PPADPSAADD LLGTLVAAAK LGALDDEELV DNAVTFLFAG HETSALGLTY
ALYCLARRPA FQDRIRSEIA PLHGDPTPAD LRECPALTAA VDEALRLYPP VHSFFREPTE
PIALGEYRIP SGVVLTLAPW SVHRDGRWWN APETYRPERW LRETEDGGVV HGDDRSGPAV
GEHPEHAFFP FGGGPRHCIG MRFARQELRL AVATILRRVR LEPVTEELSL QASANTRPDG
PVHVRILTRD EPSDPESLDQ DEPSKTSW