Gene Htur_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4166 
Symbol 
ID8744794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp436207 
End bp437703 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content60% 
IMG OID646514715 
ProductCarboxylyase-related protein 
Protein accessionYP_003405662 
Protein GI284167384 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCA GTAGCCTCCG GCAGTACCTC CAAACGCTCG AGACGAACGG AGACCTCCAT 
CGAATTAGCG AGCCGGTCTC GTGGAATCTC GAGGCAAGCG CCGTCACGAT GCTGCTGAAC
GAAGAAGACA GCGCCGTGCC GCTATTCGAG AACGTCGATT CAGCGCGACT CGTCGGCGAC
CCGTATCGGG GAACCCAACG ACGACCCTGG GAACGGATCG CCCTGGGACT CGGATTGCCG
TCGGATCTCT CGTACAGAGA GTTCTACGAA GCGGTGATCG AACGGCTGAA AAACCCGATA
GAACCGGTAA CAGTATCCAC AGACGATGCG CCCTGTAAAG AGGAGATACA GACGGGCGAC
GACGTTGATC TCCTGGACTT TCCCTGGCCG TACATTCACG CGGGCGACGG CGGACGCTAT
TCGAATCTCC ATACGCTCGT CGCACCGGAC CCTGATTCCG AGTGGGTCGA CTGGTCGAAC
CATCGAACGA TGATCCACGA CGGCGAGACG AGCAGCGTCC TCCTGTTGGC GGGTGAGCAA
ACGCCGAATC TCTACTATTA CAAGTACGAG AAACGGGACG AACCGATGCC GGTCGCGATC
ACCGTCGGCG CTGAACCTGC CGTTCAGTAC ACGTCCGTGA TGTGGATTCC GACGGGACGA
AACGAAGCGG AATACGCAGG GGGATTGAAA CAGGAACCGG TTGAACTCGT ACCCTGTGAA
ACCAACGACC TCTCGGTCCC AGCGACGGCC GAACTCGTCA TCGAGGGCGA AATCCTCCCG
AACGAGCGTC GTGACGAAGG ACCGTTCGGC GACTACTTCG GCTATATGCA CGGCCCCAGA
CGGTCGATGC CTTTGTTCCG AGTGACCGGA ATTACTCATC AAACCGATCC GATACTCCCG
TTCTGCGTCG AGGGGACCGG TGTCGGGCAT TCGGAAAACA CAACCAGTTC GATGGAAATC
GGCTGTGTCG GGCCGGACGC AACGCTCGGA CTGCGGACCG CCGGGTTCGA CGTCGAATGC
TGTGCCCCTT GGAAGTCGAC GCCGAGGACG ATCTACGCGA TCTCGACCGA GAAGACCAAC
CCCAGCTATC TCCACGATAT GGCGAATTTC ATCTTCACGA CGTGGGGAAT GCTCCACGTC
GACTTCTTCA TCTTCGTCGA CGCTGACGTC AACCCGCTCA ATCAGCGCGA AGTGCTCGAG
GCGCTCGCCC TCCACGCGGA TCCCGACGCG GATTTCCATC AGTTCGGCGT CGAGACGATG
CCGAAGGTGC CGCTCAACAT CTATCAGACG CCGACCGAGA AGGGGGACAT CCAGACCGGA
ACGTCGAAAG CGAAGACGGC GAAGGCGTAC ATCGACGCGA CCAGCGACGG AGCTGGCCGG
GAGGCGCAAC CGACCCACGA CATCGAGCGC AGATATCGGG CGCAAAAGAT ACTGGAACGG
GCCGGCGTCG AATCGAGCGA GCTGTCGTTC GTCGATCCCG GGGAGGCCAC GCAATGA
 
Protein sequence
MTTSSLRQYL QTLETNGDLH RISEPVSWNL EASAVTMLLN EEDSAVPLFE NVDSARLVGD 
PYRGTQRRPW ERIALGLGLP SDLSYREFYE AVIERLKNPI EPVTVSTDDA PCKEEIQTGD
DVDLLDFPWP YIHAGDGGRY SNLHTLVAPD PDSEWVDWSN HRTMIHDGET SSVLLLAGEQ
TPNLYYYKYE KRDEPMPVAI TVGAEPAVQY TSVMWIPTGR NEAEYAGGLK QEPVELVPCE
TNDLSVPATA ELVIEGEILP NERRDEGPFG DYFGYMHGPR RSMPLFRVTG ITHQTDPILP
FCVEGTGVGH SENTTSSMEI GCVGPDATLG LRTAGFDVEC CAPWKSTPRT IYAISTEKTN
PSYLHDMANF IFTTWGMLHV DFFIFVDADV NPLNQREVLE ALALHADPDA DFHQFGVETM
PKVPLNIYQT PTEKGDIQTG TSKAKTAKAY IDATSDGAGR EAQPTHDIER RYRAQKILER
AGVESSELSF VDPGEATQ