Gene Htur_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5037 
Symbol 
ID8745843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp24498 
End bp26849 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content61% 
IMG OID646515651 
Producthypothetical protein 
Protein accessionYP_003406598 
Protein GI284176322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAC AACATCACCT TCACCAACCG AGTGACAACA CCGAGACGCG AGTTGCGGTC 
CGGAACAAGC TCTCGGTCCG GATCGAACGC CAACGGCGTA AAGAGACGCT GAATACAGAG
GCCCATGTCC GCGAGTTAAT CCGCGACGCC GAGGGCGCGA TCGTCGGCGA CGACGAGACT
GACCGCCGCG CCGAGGAACT GCTCGAGGAG TTGCACCTCT ATCTGTGGCA TGACCCCGAA
CGAAAGGCAC CAACCGACGG CGATACGCTC CCGGCAGACG CTCGGTACCG GTTCAAACAG
ATTCACCGCC AATTCCAGGC CGGCCACGAT ATCGAATCGC GGACATGGTT CGAAACGCTG
GTGTCGGTCG CGAAAGGCCT CAACTACGTC GAAGCGTTTA CCGATCTGAC CCAGTACGCA
CCGGTCCGTC TCGAGACGTT AAACGAACAG GGCCGCCAGG GTAACGTCGA GACGGCGACG
CCGATCGGTC GCCGTCGGAT CGGGGCCGAC GACGCCGCCG ACCTCGAAGA CCGAGCCGTC
GAGATTACAC ACTCGTCGTG TGACCACATT CTCGCGGTTG CACTCCCGCG GTCGGGGAAG
GACTCGACGA TCACCAGTAT CGGGATGAAC CTCTGGAAAG AGCACAACTA CTCGTATTTC
AGCATTCTCG ACGACGGCCG GATGGAGACA CCAATGGTCT CGATACCCAA CGACGAGGAT
GCGATTCAAC GAAATCTCGA GCGGATGGGG CAGGAACCAG AGGCGTTCGA CGCCGAGGTG
TTCGTCCCGG CGATGGACGG CATTCCGTCA CACCTCCCGG CGAACTTCAA GCCGTTTACC
ATCGGGATTG ACGACCTCAC ACCCCATCTG ATTCTCCGAC TCGCGGGGAT TACGAAATCC
GACGCGAACA CGGAACAGCG GATCAAACAG GCACTCGATA AGACCCTCGA GCGCACCGGC
GAAGTGACTG AGTTGGTCTC GCGGTTGCAG GTCTACGCGA AGGAAATGGA CGCGACAATC
GAGTGGGTCG AGAAAGGCGA CGACTCGGTC GAATCCAAAA GCGTTCAATA CCACATGGAG
GCGGAGGATG CACTAAACAA GGCCGCCCAG CGGCTTGCTC ACCTCGCTGG CGAGGGGCTC
GTTGCGTCGC CCAGCGCCGA GACGAACATC GATATGGCCT CTGTCATCGC GAACCAGGAA
CAGGCCGCCG TCCTCTGTTG TAACTTTCTC ACGCAAGGGC AAGAGGCGCT GAAATACACG
ATTATGGACC TCTGGTTGCG GTTGATCTAT CAGGCCCGAG ACGGTGATAT GCGACTCCCG
CGTGTCTGCC TCGAGATTCG CGAACTGAAG AACATCGCCC CGAGTAAGCT GGCCGACGTT
CGGTACAAAG AGACCATCAA GACGCTTCGA CAGACCATCT TCTTTATTTC AACACAGGGT
GGCAGCCGCC GAATCTTGAT GCTTGGCTCG ACCCAGAAAC TGAACGACGT CTACAAGCCG
GTCCGGTCGA ACATGGCGAC GAAGATTCTG TTACGACTCG GCGAGGAGGA AATCGAGACG
CTGGACCGGT CGTACAATTT CAGTAACGAA CAGATGCGAC AGCTCTCCGA GTTCGATATC
GGACAGGGGA TGATCCTTGC CGGCGGCGAG GCCTACTGGC CGATCGAACT CCGCGGGGCG
CCGTGTGGGC TCGGACTCGG CGATCGCCAT TGGCTCGACC GGTATGGCCT CGCGTGGGGC
GCTCGCGTTC GCGAGAGCGA GTACGACGGC TGGCGGACGA AACACGGCGA CTGTGCGTAC
TGGGTGGATC TAACCGAGAA CACGGTCGTC ACCGACGGCT CGGTCCCCGA GGTCGGGACG
TGGCACCTAC TCCCCGAGGA CTTCGATGCT GACCTCGAGC CCGAGGCCGT CGACCAGGAG
GCGATCGACG CCGCCCTCGA GCGGCGCCGC GAGTATCCCG TGAAGTCCGA CCTCTCACTC
GAGCCGACCA GCTTTGGCGA CCGGCAGCGC GACCTCTCCT TACAGCAACA GGACCGAGAC
ACGACGCTCT CGGAGGTGGT CGAACAGCAC AACATCCCCG AGGCCGTCGC GCCGTGGCTC
TCGAAAGAAG CGCCGACGCG AGAGCAACTG GTTGCGGCCT GTCGGGCGAT CGACGAGCAC
GACGACCTCG CACGACAGGC GGATATCGCC GAGCATATCG ACTGGTCTCG TAGCAACCTC
GCGACGCATC TGAGCAAGAG CGAGAGTCTG AAGAAATGTG TCACGAAAAG CGGCGGCACC
TACGAGTTGA CGCCGATCGG GAAGCGAGCG GCAGAGGTCA AATGGAAAGT CGTGATGGAG
GAACTGAAAT AG
 
Protein sequence
MSSQHHLHQP SDNTETRVAV RNKLSVRIER QRRKETLNTE AHVRELIRDA EGAIVGDDET 
DRRAEELLEE LHLYLWHDPE RKAPTDGDTL PADARYRFKQ IHRQFQAGHD IESRTWFETL
VSVAKGLNYV EAFTDLTQYA PVRLETLNEQ GRQGNVETAT PIGRRRIGAD DAADLEDRAV
EITHSSCDHI LAVALPRSGK DSTITSIGMN LWKEHNYSYF SILDDGRMET PMVSIPNDED
AIQRNLERMG QEPEAFDAEV FVPAMDGIPS HLPANFKPFT IGIDDLTPHL ILRLAGITKS
DANTEQRIKQ ALDKTLERTG EVTELVSRLQ VYAKEMDATI EWVEKGDDSV ESKSVQYHME
AEDALNKAAQ RLAHLAGEGL VASPSAETNI DMASVIANQE QAAVLCCNFL TQGQEALKYT
IMDLWLRLIY QARDGDMRLP RVCLEIRELK NIAPSKLADV RYKETIKTLR QTIFFISTQG
GSRRILMLGS TQKLNDVYKP VRSNMATKIL LRLGEEEIET LDRSYNFSNE QMRQLSEFDI
GQGMILAGGE AYWPIELRGA PCGLGLGDRH WLDRYGLAWG ARVRESEYDG WRTKHGDCAY
WVDLTENTVV TDGSVPEVGT WHLLPEDFDA DLEPEAVDQE AIDAALERRR EYPVKSDLSL
EPTSFGDRQR DLSLQQQDRD TTLSEVVEQH NIPEAVAPWL SKEAPTREQL VAACRAIDEH
DDLARQADIA EHIDWSRSNL ATHLSKSESL KKCVTKSGGT YELTPIGKRA AEVKWKVVME
ELK