Gene Htur_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0042 
Symbol 
ID8740605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp48410 
End bp50626 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content63% 
IMG OID646510605 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003401616 
Protein GI284163337 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA TAGCACAGTC GGAGGTGAGC AGGGAGACCT ATCTGGGGCT CGGCGGCGTG 
GAATACGTGC TGTTCTACTT CCTGGTCGCA GTGACCCTTG CCGTCTTCGC TTACGGCGTC
TACCGACGGT TCAGTCGGTA CACCGAAGGT GACGACGATC CGTTCGCTCG GGTCGACGGA
CTGCTGAGCC GAACCGTCAA CGCGGCGAAG ATCGTCATCA CGAACGAGAA ACAGTTCAAC
AGGGATCTCT ACGGCGGCCT GATGCACTCG TTTATCCTCT GGGGATTCCT GACGCTGCTC
ATCGCGACGT CGATCATCGC CGTCGAGCAG TACGGAACCG AACTGTTGCT CGGGTTGACG
TTCTGGGAGG GCGACTTCTA TCTCACCTAT CAGTTCATCG TCGACGCGAT GGGCCTGCTC
TTCGTCGTCG GGATCGGAAT GGCCCTCTAC CGACGGTACT GGGTCCAGAA TCACCGCCTC
TGGGGCCGGC ACACCTCTAA CGAGGACGAT ATCTTCATCT GGACGCTGTT CGGCCTCGGC
ATCGGCGGCT TCCTGCTCGA GGGCTTTCGC ATCTACATCA GCGGCATGCC GAGCCACGAG
GTCGTCAGCT TCGTCGGTTA CGGGCTGGCG ATGGCCTTCG ACGCGGTCGG CCTGCCGACG
ACCGGTGCGG TGCTGGAGGG CGGCGCGCTC AATCCCGACT ACGCCGCAGT CGATCAACTC
GGCTTCAACG CCGAGACGCT CCACTGGCTG ACGTGGTGGT CCCACTCGCT GCTCGCGCTC
TTCTTCATCG CGTGGATCCC CCACGCCGGG AAGCCGTTCC ACATGCTCTC CTCGTTCGCG
AACGTCGTCA CGCGCGACGA GAAGGCCGGT CGGCGCCTGC CTAACGTCCC GTCGGATCTG
GACGCCACGA ACGCCGAGTC CATCGACGAC TTCACCTGGA AAGAGATTCT CGATCAGGAC
GCCTGTACCA AGTGCGGTCG CTGTTCGTCG GTCTGCCCCG CAAAGGCCTC CAATCGCCCG
CTCGATCCGC GAGACGTCAT CCTCGACCTC CGCAAGTATC GCGAGGAACT CGAGGCCGGC
GGCGAGGAGA AGCCGATCAT CGCCGACGGG GGGACCTCAG TGATCAACAC GGAGACCATG
GAGTCCTGTA TGGCCTGTAT GGCCTGTATG GACGCCTGTC CCGTCGAGAT CGAACACCTC
CAGTCCTTTA CCCGGCTCAA CCGGCAGATG ACCGACCAGG GCGACGTCGC CCCCAGCATG
CAGGACGTCT TCCAGAACGT CATGCAGAAC GGCAACACCT TCGGCGACTC GCCGCGCAAT
CGCGGCGATT GGAGCGAGGG CCTCGAGTTC GACGTGCCCG ACGCCCGCGA GGAGACGGTC
GACTACCTCT GGTACGTCGG CGACTTCCCG AGCTACGACG AGCGCAACAA GCAGGTCGCC
CGCTCGCTGG CGACCATTCT GAAGGAGGCC GACGTCAGCT TCGGCATCCT CTTCGACGAC
GAGAAGTACG ACGGCAACGA CATCCGCCGC GTCGGCGAGG AACTGCTCTA CGTCGAACTA
GCCGGCCACC ACGTCGAGAC CTGGGAGGAC TGCGAGTTCG ACAAGATCGT CTGTACGGAC
CCCCACTCCT ACAACACGTT CAAAAACGAG TATCCGGAGG TCAACTTCGA CGAGTTCGCC
GACGACCCGA TGATGCCCTT CGAGTACGAC GAGCAGTGGA ACGAGGACGG CGAAATCGAG
GTCTACCACT GGACCCAAGC CGTCGAGGAG CTAGTCCGAG AGGGCAAACT CGAGCTGTCG
GGCACCGAAC TCGACTACAC GGTCACCTAC CACGACCCGT GCCACCTCGG CCGGTACAAC
GACGAGTACG AAGCGCCCCG GGAACTCATC CGCGCCACGG GCTGTGAACT CGACGAGATG
CCCCGCAACC GCGACAACTC CTTCTGCTGT GGCGGCGGCG GCGGCGGCCT CTGGATGGAC
TTCGAGGAAG AGCCAAAGCC CAGCGAAGAG CGGATTCGGG AAGCCCTCGA GGACACCGAC
AACGGCCCCG GCGTCGAGAA GTTCGTCGTC GCTTGCCCGA TGTGCATGAC AATGTACGAG
GACGGGCGCA AGACCGGCGG CTACGAGGAC GAGATCGAGA TCGTCGACGT CGCCGAACTC
ATCGTCGAAG CGATCGGCGC CGAGGAGGAA GCGAATCTCG AGGTCGCGGC GGACTGA
 
Protein sequence
MNAIAQSEVS RETYLGLGGV EYVLFYFLVA VTLAVFAYGV YRRFSRYTEG DDDPFARVDG 
LLSRTVNAAK IVITNEKQFN RDLYGGLMHS FILWGFLTLL IATSIIAVEQ YGTELLLGLT
FWEGDFYLTY QFIVDAMGLL FVVGIGMALY RRYWVQNHRL WGRHTSNEDD IFIWTLFGLG
IGGFLLEGFR IYISGMPSHE VVSFVGYGLA MAFDAVGLPT TGAVLEGGAL NPDYAAVDQL
GFNAETLHWL TWWSHSLLAL FFIAWIPHAG KPFHMLSSFA NVVTRDEKAG RRLPNVPSDL
DATNAESIDD FTWKEILDQD ACTKCGRCSS VCPAKASNRP LDPRDVILDL RKYREELEAG
GEEKPIIADG GTSVINTETM ESCMACMACM DACPVEIEHL QSFTRLNRQM TDQGDVAPSM
QDVFQNVMQN GNTFGDSPRN RGDWSEGLEF DVPDAREETV DYLWYVGDFP SYDERNKQVA
RSLATILKEA DVSFGILFDD EKYDGNDIRR VGEELLYVEL AGHHVETWED CEFDKIVCTD
PHSYNTFKNE YPEVNFDEFA DDPMMPFEYD EQWNEDGEIE VYHWTQAVEE LVREGKLELS
GTELDYTVTY HDPCHLGRYN DEYEAPRELI RATGCELDEM PRNRDNSFCC GGGGGGLWMD
FEEEPKPSEE RIREALEDTD NGPGVEKFVV ACPMCMTMYE DGRKTGGYED EIEIVDVAEL
IVEAIGAEEE ANLEVAAD