Gene Htur_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3940 
Symbol 
ID8744568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp195317 
End bp197776 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content66% 
IMG OID646514521 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein 
Protein accessionYP_003405468 
Protein GI284167190 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACC TGGAAGAGAC GCAGGGACGG AACGATAGAC CCGACGGCGT TCGGGCGGAG 
GAGGTGGCCT CGGAAACGGA CGCGTCTCCG GGGGAGGACG AGGTGGCCGA GGACGACCGG
AAACCCCGGG AGGAACGCGA ACATCTGACC GAGAACGTTG AGAAGGACGA CGCGCGGAAA
ATCGTCACCG GAGAGGCCCG CTACACGGCC GATTACCGCG ATCGCTTCCC CGAACTCGCC
GAAGGGAAAG TGATCCGAAG CGACATCGCG CACGGCTACG TCCGCGACAT CGACGTGAGC
GAGGCCGAAG CGATGGACGG CGTCTACGCC GTGATCACCC CGTGGGACGA CGTCGTCCCC
GATAAGGCGT ACTCGAGTTC GGGCCAGTCC TATCCCGAAC CGAGTCCCTG GGATCTGCGC
GTACTCCGCG AACACGTTCG GTATGTCGGC GACCCCGTCG CGGCGATCGC CGCGAAAGAC
GCGGAGACGG CCGACCGCGC CGCCCGGACG ATCGACGTCG AATACGAGGA GCGAGAGCCC
GTCCTCGACC CCGAGGAGGC GATGGAGCCG GACGCTCCAC AGTTGTTCGA CCCCGCGGAG
GTCGAGAACA AACAGCGCGG CGCCGACTAC GAGCGAAACC TCGAGTCTCA TTTCGAGGGC
GAGCGGGGCG ATGTCGAGCG GGCGTTCGAA CGAGCGGACG ACGACCGGGT GATCGAGACG
GAGTGGGAGA CGCCGTACCA GTCACACTGC GTCCCCGAAC CGCACACCAC GATCGTCCAC
ACGGACGAGG ATGACCGCTA CTCGTTTATC ACCGCGACGC AGGTCCCGTT CCACACCCGA
CGCCAGATCG CACACCTGTT CGACGTACCC ATCCGCGACG TCCGGGTCAC GAAACCGCGC
ATCGGCGCGG GGTTCGGCGC GAAACAGGAG ATGGCGATCG AACCGATCGC GTTCGCGCTC
CACCTGGCGG CGGACAAGCC GGTCAAACTG GAGATGACCC GCCGCGAGGA GTTCTACGCG
CTCCGGTTCC GCCACCCGAT GCGGATGCGC ATGCGGACGG CGGTGGACGA CGACGGAACG
ATCGAGGCGA TGGATCTGTA CGCGCTATCG AACTCGGGGG CGTACGGCAC CCATGGGATG
ACCGTCGCGA ACAACGTTGG AACGAAGCCG CTGCCGCTGT ATCCGCGCGT TCCGAACGTC
CGGTTCGAGG GGGACGTCGT CCACACGAAC CTTCCGATGG GCGCGGCGAT GCGCGGCTAC
GGCGCGCCGC AGGGCCACTT CGCGGTTGAG GCGCACATGG ACGAGGTCGC TCGCCGGCTC
GACCTCGACC CGATCGAATT CCGCGAACGC AATGCCGTTC GCGAGGGCGA CGTCGACCGG
AACGTCGCCA TCCTGAAAGA CGACGAGAAG TTCACCCGAG AGATACGTTC TTGCGGGCTC
CGCGAGTGCA TCGAACGCGG GAAGGAGGCC ATCGGTTACG ACGACCTCGA CCGACCCGAG
GAGGCGCACC GCTCCCGCGG CGTCGGCGTG GCGCTCATCG CGCAGGGAAG CGGCGTCGCC
GGGAGGGAGC TCGGTGCGGC CCAGATCAAG ATGAACGAGG ACGGTTCGTT TCACCTCCAG
GTCGGCGGCG TCGACACCGG CACCGGCTCC GACACGATGT TCAGCCAGGT CGCCGCGGAG
GTTCTCGGGT GCGAGCCGAC GGATATCGTC GTGATCTCAT CGGACACCGA TCTGACGCCG
TTCGACTACG GCGCGTACGC CTCCTCGACG ACCTACATCA GCGGCCAGGC CGTCAAGGAG
GCCGCCGAAG ACGCCAGGGA ACGGCTGGTA CACTGGGGAT CGAAGATGCT CGACGAACCC
GTCGAGAACC TGCGGACGGG CGACGGCGAG GTGTACAGCG AGGTAACCGA CGAGAGCGTT
TCGCTGGAGG AGATCGGGTA CGAGGCGACC TACGGCCACG ACGACCGCGA ACACATTCTC
GGGAAGGGAA ATCACTCGAC GGACGAGAGT CCGCCGCCGT ACGGCGCCCA GTTCGTCGAC
GTGACCGTGA ACGAGGAAAC CGGCGAATAC GACATTAATA AGATGGTGTT CGCCGCCGAC
TGCGGCGTCG CGATCAACCC CGCGCTCGTC GAGGGGCAGA TCGAGGGGGG AGAGCACATG
AGCCTCGAAT TCGCGACCAG CGGCGGACTG ACGTTCGACG AGGAGGGGAA CCCCGAAGTA
CTCGGTTTCC GCCAGTACGG CATGCCGCGG ACGACGGACC ACCCGCCGAT GGAGACGATC
ATCGTCGAAA CGCACGAACC GACCGGACCG TTCGGTGCGA AATCGATCGC CGAACTCCCG
ACGAACGGCG TTCCACCGGC TCTCAGCAAC GCCGTCCGAG ACGCCGTCGG CGTCCGGGTC
AACTCCCTAC CCATCACGGC CGACGATATC AAACGGGCCC TCGAGGAACG GGACGGGTAG
 
Protein sequence
MSNLEETQGR NDRPDGVRAE EVASETDASP GEDEVAEDDR KPREEREHLT ENVEKDDARK 
IVTGEARYTA DYRDRFPELA EGKVIRSDIA HGYVRDIDVS EAEAMDGVYA VITPWDDVVP
DKAYSSSGQS YPEPSPWDLR VLREHVRYVG DPVAAIAAKD AETADRAART IDVEYEEREP
VLDPEEAMEP DAPQLFDPAE VENKQRGADY ERNLESHFEG ERGDVERAFE RADDDRVIET
EWETPYQSHC VPEPHTTIVH TDEDDRYSFI TATQVPFHTR RQIAHLFDVP IRDVRVTKPR
IGAGFGAKQE MAIEPIAFAL HLAADKPVKL EMTRREEFYA LRFRHPMRMR MRTAVDDDGT
IEAMDLYALS NSGAYGTHGM TVANNVGTKP LPLYPRVPNV RFEGDVVHTN LPMGAAMRGY
GAPQGHFAVE AHMDEVARRL DLDPIEFRER NAVREGDVDR NVAILKDDEK FTREIRSCGL
RECIERGKEA IGYDDLDRPE EAHRSRGVGV ALIAQGSGVA GRELGAAQIK MNEDGSFHLQ
VGGVDTGTGS DTMFSQVAAE VLGCEPTDIV VISSDTDLTP FDYGAYASST TYISGQAVKE
AAEDARERLV HWGSKMLDEP VENLRTGDGE VYSEVTDESV SLEEIGYEAT YGHDDREHIL
GKGNHSTDES PPPYGAQFVD VTVNEETGEY DINKMVFAAD CGVAINPALV EGQIEGGEHM
SLEFATSGGL TFDEEGNPEV LGFRQYGMPR TTDHPPMETI IVETHEPTGP FGAKSIAELP
TNGVPPALSN AVRDAVGVRV NSLPITADDI KRALEERDG