Gene Htur_5098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5098 
Symbol 
ID8745903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp62188 
End bp65331 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content65% 
IMG OID646515711 
Producthypothetical protein 
Protein accessionYP_003406658 
Protein GI284176382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.178666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGCT TCGACATCCT TCCGTGCAAG ACGTGTTCGA CCACTTGGCT GCAAGAATCC 
GGGTGGCGAG CACAGGATAC CGTCTCGTGT CCGCATTGCG GGGCCGAACG CAGTACGGAT
CTCGTTAAGA TCCGTGGCTC ACAGGAGACC AAGGCCGGCG CCGCCGAGCT TCGGTCTCGG
ATCGAAGCCG CCGAAGCCGG CGAGTCAGAG GTGTACGACC AGTTCATCGA GGACCGTGGC
CAGTACGCCG ACCAACTCGC CGAAGTCGAA GGCCAGATTG ACTCGTTCAC GCTCGACGCC
GAGCAGATGG ATCTGACGCC GATCGAATCC GATCGGTTCG AACCGCTCGC CGAAACGATC
CTCCGACCGG AACGGAAGAA GTTCAAGGAG TGGGCCGACG AGTACAGCGG CATCGGCGAG
GACCGATTCG CCGACCTGGT ATCGTTCGAA CGACCCGGCG AGGGCTTGTT CGATGCCGAG
ATTCGCGACG ACCTTGAGCA GTGGGCCGGC GACGTCGATC GGGACGAGCT GGCCCGCGGC
GACGTAACGG TAACTGACCA GCAACCGGCC GCGGCCGCGA CCATCATCGA TCTGGACGCG
ACGACGGTCT CCGAGGTATG GGCGGCGCTG TTCGACAGTG AGTCCGTCCG GGCCGCGTTC
GCCGAGTCCA TCGTCGAACT GTTCGGTGGT CTCACACCCT TGGAGTGTTA CGACGTGCTC
GAGGAGTACG GCGTCCCGTT CTGGGTTCGA TCCCACATCG TTGACGTTGC GCGTGGGTAC
GCAGGCGACG CTGTGGCGAT CGACGGTGCC GCCGACGCCC GGCGCGTCGT CGACGAGATT
ATCGCGCCGC TCCCGAACGC GTCCCTCGCT GGTACCGATG ACCTACTCGC CATCGCGAAT
CTGTTCGACG GCCTCGAGAC CGAGCCGACA CTCGGCGTCG TTGTTCGGGA GTCGTTCATC
GAGGATGTCC GGCGCGACCA GCGGATCGAT ATCTGTGACC TGCTGGCCGT GCTGGCCGCC
GGCTGTGATG TTCGACTCGT TGGGTCGACC GTGACGCTCG CGAAGGTCGC GAACAGTCAC
CGAGCGACCC TCCCCGGCGT TAGTGAGTGG TGCAATCGTC ACCGTGAAGA TACGCAGATC
GACGACACTC AACAGCGTGT GGCCGACGAC CTCGAGCGCG GTGACTTTGC GGTCACAATG
CTCCGCGAAC TGGACCGTGA ACCAACCGGG ATTTTCACGT ACTCCGAACT GTACGCGCTG
TATCCCGGCG ATGATGACTC TCGTGTTCGC CAACTTGTCG GCGAGTTCCA CGACGCCGAT
CTCGTCGAAC GCTTCGGTCC GCGAACCGAT CGGAAGGTCG AACTGCTCCC GGCTGGCCGA
CGCGTTCTCG AGTTCTTCGA ACAGCAAATC GCACAACAGC GGTCGATTTC CGACTTCGTT
AGCGGCGCCG GTAAACAACA ACAACAGGGC CGTGTACACA CCCAGACGGG AGGGGGTGGG
GAGGACGGGG CCGGCGAAGA CAGCACCGAC GGCACCCGCC ACTACAGCAC CCGCTACATG
AGCCCGGCCG AACACGCCGC CACCGCGGCG TGCGGACAGA ACGGCGGTGT GACGCTTGTG
CGCGGGGGGA TAGAGGACCA CGCCGACCGG ACCCGGTACG CGAGTTACGA CCCGAAACGG
GGGGAGGCCG TTGTGGCCGT ACAGGCCGCT GGACCCATGC AGATGACGGT GAGCGCTGCA
TTAGCGTTAG CCAGCCCCGA GTTTGTGGAC CGAACACTCC CGGCAGACCG ACTCGAGTCC
ATTGAGGACC CACCGGCGAT CGTGCGTGAC GCCCGCTGTA TCGGTGGGGC ATCCCAACAG
GCTCTCGAGA ACGGCCAGCA GTTCCGCAAG GCGCTGGTCG AGTGGGGCAA GGATCTCTCG
GAAATGACGA CCAAGCTCAA GGCCGGCAAC CTCTCGACGG ACCGCGCGGC CTTCTGTGGC
GAGATCATCC GTTCGGCACA GGGTCTCTGG GGGACGCTCA CGCACCTGCT GGATCTGTTC
GATATCGATG TCCACCGGGA GATTCGTATC CCGTCGGGCC TCTCGAGTGA CAATCTCGAG
GACCTCGCGA AGTCGATCAG TTACGCGGCG GCTATCCAGT CGACGTACAA CGGCCACTTC
GCGTGCTACC GGCAACTGTT CGAAGATCGG GACGACAAGC GCCGGGCGTC GTTCACCGCA
CAGGTCGACG CGGCGGCGCC GACGGGCTCG CTCATCGGCT CGTTCGTTCT CCGTGGCCCG
GACGTCCACC GACTCGAGGA ACCGCTTCAG ACGCGCCTCG AGTCGCCGCG TGACGTCCAC
GACGACGCCC CGGAGTTCGG CGTCGATATC ACGGTCCGAA CCGACCTCGA GCGCACGGAC
TACGACGAAG CCGTTCGCCG TGTCCTCTCC CGTAAGCGAC TCCGCACGAC GACCGCCGCC
GTCTCGGTCC TGTACGCGCT CGTCGCGACG CCACACGACG CGGCTCGCGT GCTCCACCGC
CAACTCGCCG CCGAAGACGA GTCCCGAGAG ATCCGGCCGG ATGAACTCCG AACCGCGCTT
CGCGAGCTGG ACCCGACGGC ACTGCTCCCG ACGATCGGCA ACGAGCGCCG GACCAACTCG
GCGGGCAAGA TCGTCGCGGC GCTGCTGGCC GCCGACGAAC CACTCTCGAA GGCCGACCTC
GCCGACCGGG CCGGCGTCAC GAAAAAGACG GTCTACAACT ACCGCGAGAA ACTCGAAACG
CTCGGCCTCC TGGTCGTCAC CGACGAGGGC TACCGGCTTG CACTGTCGTT CCCGACGACC
GAGGAACGCA AACAGCCCGT ACTCCCGGCG TTCGTCGACC GGACGTTCAC CGAGGCCGCC
GACGCGCTGC TCGTCGAGTC ACTCCCGCCG AGTCGCTACG GCGACCCGGA GGACTCACTC
GGCGGCCTGT TGTTCTGGAC CGACGACAAC CCACCGAACC CGTGGGCGCT GCTCGAGCAC
GACGACTACG CTCCGTGGGC GGAACTGGCC CGGAGACTCA CCGACGGCGA CCGGACGCGA
CCGGCGGAAC TCCGTGTGTT AATGGGGCCG GAAATTAAGC AACAGTCGAT CGACGCGGCC
ACCTCGAGCG CGGCGGCCGA CTAA
 
Protein sequence
MRGFDILPCK TCSTTWLQES GWRAQDTVSC PHCGAERSTD LVKIRGSQET KAGAAELRSR 
IEAAEAGESE VYDQFIEDRG QYADQLAEVE GQIDSFTLDA EQMDLTPIES DRFEPLAETI
LRPERKKFKE WADEYSGIGE DRFADLVSFE RPGEGLFDAE IRDDLEQWAG DVDRDELARG
DVTVTDQQPA AAATIIDLDA TTVSEVWAAL FDSESVRAAF AESIVELFGG LTPLECYDVL
EEYGVPFWVR SHIVDVARGY AGDAVAIDGA ADARRVVDEI IAPLPNASLA GTDDLLAIAN
LFDGLETEPT LGVVVRESFI EDVRRDQRID ICDLLAVLAA GCDVRLVGST VTLAKVANSH
RATLPGVSEW CNRHREDTQI DDTQQRVADD LERGDFAVTM LRELDREPTG IFTYSELYAL
YPGDDDSRVR QLVGEFHDAD LVERFGPRTD RKVELLPAGR RVLEFFEQQI AQQRSISDFV
SGAGKQQQQG RVHTQTGGGG EDGAGEDSTD GTRHYSTRYM SPAEHAATAA CGQNGGVTLV
RGGIEDHADR TRYASYDPKR GEAVVAVQAA GPMQMTVSAA LALASPEFVD RTLPADRLES
IEDPPAIVRD ARCIGGASQQ ALENGQQFRK ALVEWGKDLS EMTTKLKAGN LSTDRAAFCG
EIIRSAQGLW GTLTHLLDLF DIDVHREIRI PSGLSSDNLE DLAKSISYAA AIQSTYNGHF
ACYRQLFEDR DDKRRASFTA QVDAAAPTGS LIGSFVLRGP DVHRLEEPLQ TRLESPRDVH
DDAPEFGVDI TVRTDLERTD YDEAVRRVLS RKRLRTTTAA VSVLYALVAT PHDAARVLHR
QLAAEDESRE IRPDELRTAL RELDPTALLP TIGNERRTNS AGKIVAALLA ADEPLSKADL
ADRAGVTKKT VYNYREKLET LGLLVVTDEG YRLALSFPTT EERKQPVLPA FVDRTFTEAA
DALLVESLPP SRYGDPEDSL GGLLFWTDDN PPNPWALLEH DDYAPWAELA RRLTDGDRTR
PAELRVLMGP EIKQQSIDAA TSSAAAD