Gene Htur_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3729 
Symbol 
ID8744355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3843621 
End bp3844961 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID646514316 
Productcytochrome P450 
Protein accessionYP_003405264 
Protein GI284166985 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG CACGACCGTC GACGTCCTCG CTTCCGGGCC CTCGGGGCCT CCCGTTCGTC 
GGAAACACGA TCTCGTTCGC CCGCGAACCG CTCGCGTTCC TCGAAGCGAT CCGCGAGTAC
GGCGACCTCG CCCGGTACGA GGCGTTCGGT CGCGAGTTCG TCGTCGTTTC CCGCCCCGAT
CTCGTCGAGG CGGTGCTGGT CTCCCGGAGC GACGAGTTCT GGCGGGGATC GTTCGAACAC
GAGTTGGGCG AGGGAGTCGG TATCGAAGGC GTGTTCTTTT CCGAAGGCGA GCAGTGGCGA
CGGCAGCGAC TGCTCTTACA GAACGCGTTC ACGCCGGTGC GGATCGAATC GTACGCCGAG
GTCATGGTCG ACGAGACCGT CCGAGAAGTC GCCAGCTGGC CTGAGGGAGG AGTCATCGAT
GTGAACGAGC GGCTGTCGGC GCTGACTCTC GGCGCGCTCA CGCGGTCGCT GTTCGCCCTC
CCGCTCGAGG GCGACCGCGC CGACCGCGTG CGACGCTGGG TCGACGCCAT GGGCGCGTAC
CTCGAAGCCG ACTTCTTCGG GCCGGGTGCC GTGTTGCCAT CGTGGTTCCC GCGGCGAACC
GAACGCGAGT ACGAGCGCGC CACAGCCGAC GTCGAAGCGC TCGTCGGGGA CCTCCTGACG
GAGCGTCGGG AATCGGACGC CGAGGGTGAC GACCTCCTCT CGCTGCTCGC GACGGCCGAG
TACCCCGACG GGACCCGTCC CTCGGCCGAC GAGATCTCCG ATCAGCTGTT GACGTTCCTG
CTCGCGGGCC ACGAGACGAC CGCGACCGCG CTCACCTACG CCTGCTGGTT CCTCGCGGCC
GACGACGAAA TTCGGGAACG GCTCGAGCGG GAGGTCGAGG CCGTCTGCGG CGACCGCGAC
CCGACGTTCG CCGATCTCCC GGAACTGACC GTCGCCGAGG CCGTCGGCCG TGAAGCGTTG
CGACTCTATC CGCCGCTGCC GTTCCTCCAT CGAGAGCCCC GCGAGCCGAC GTCCCTCGAC
GGCGTCCGCG TCGCGCCTGG AACGACGATC CAGCTGAACA TGTACGGGAT CCACCGTGAC
GAGCGCTGGT GGGCGGCTCC CGATTCGTTC CGTCCGGAGC GCTGGCTCGA CGACGCCGAT
CGACCCGAGT ACGCCTCCTT CCCCTTCGGC GGGGGACCGC GACACTGCAT CGGCATGCGA
TTCGCGATGA CGGAGCTCAA ACTGTCGCTG GCGACGATCG CGCGTCGGGT CCGATTCGAC
CGCGTCTCGA CGTCGCTCGA TCCGTCGATC GAGGTCTCGC TCGATCCCGG GACCGTCGAA
ATGCGGGTTC GCCGACCGTA G
 
Protein sequence
MSDARPSTSS LPGPRGLPFV GNTISFAREP LAFLEAIREY GDLARYEAFG REFVVVSRPD 
LVEAVLVSRS DEFWRGSFEH ELGEGVGIEG VFFSEGEQWR RQRLLLQNAF TPVRIESYAE
VMVDETVREV ASWPEGGVID VNERLSALTL GALTRSLFAL PLEGDRADRV RRWVDAMGAY
LEADFFGPGA VLPSWFPRRT EREYERATAD VEALVGDLLT ERRESDAEGD DLLSLLATAE
YPDGTRPSAD EISDQLLTFL LAGHETTATA LTYACWFLAA DDEIRERLER EVEAVCGDRD
PTFADLPELT VAEAVGREAL RLYPPLPFLH REPREPTSLD GVRVAPGTTI QLNMYGIHRD
ERWWAAPDSF RPERWLDDAD RPEYASFPFG GGPRHCIGMR FAMTELKLSL ATIARRVRFD
RVSTSLDPSI EVSLDPGTVE MRVRRP