Gene Htur_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_1460 
Symbol 
ID8742051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp1520672 
End bp1524439 
Gene Length3768 bp 
Protein Length1255 aa 
Translation table11 
GC content68% 
IMG OID646512036 
ProductProtein of unknown function DUF2223 
Protein accessionYP_003403019 
Protein GI284164740 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4945] Membrane-anchored protein predicted to be involved in regulation of amylopullulanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGCCG CGGCAGCGTA CACGCGAGCC GTCCCCGAAC GCGCGGCGGC GCGAGTCTCG 
AACGGCGACG ACGACGCCGT CCAGCGCGTC AGTTCGCCGG ACGGCACCGT CGAGATGACC
GTCGACGTCT CCGACGGGAC CCCCCGCTAC GCGGTGGCGG TCGACGGCAC AACCTATCTC
GAGCCGTCGA CCGTCGGATT CGACTTCCGG AACCAGCCGT CGTTCGGAGC GAGCGCGGAC
GGCACGACCG GTTCCGAGGT CGCGGTCACC GGAAGCGAGC GCGCGTCGGA AACGGAGGTC
TGGGAGCCCG TCTGGGGCTC GTACGACCGC GTGAGCGCCG AGTACAACGC GCTGATCATC
GGACTCGAGG AGACGGCCGA ACCGGGCCGG TCGGCGAACC TCGAGGTCCG CGTCTTCGAC
GACGGGGTCG GTCTCCGAGT CGTGTTCGGC GACAGTTTCG CGAGCAACAG CGGACGGGCC
GTCGTCACGT CGGAGAACAC CGCGTTCGCG TTCGCCGACG ATGACACCGC CTGGTGGATC
CGAAACGAAG TCACCAACCC CCGGTTCGAA CAGGAGTACG AGGAGACGCC GCTCAGCGAG
ATTCCCGGCA GCTCGCGGGA GACTCGACCC ACGGGGACGC CGATGCGAAA CGGCGCGCAC
ACGCCGCTGA CGGTCGAGGC CGGCGACGGC GACGTCTACC TGAGCGTCCA CGAGGCGGAC
CTCGAGGACT ACGCGGCCGC GACGCTCGCG CCGCGATCGG ACGAGGGCGG AACGGAGCTC
GTCACGGAAC TCACGCCGCT ACCGGACGGG ACGAAGGCGT CGCTTTCGCT CCCGAACGCG
ACGCCGTGGC GGACGATTCA GATCGGCCGC CGACCGGGCG ACCTGATCGA GTCGGAGCTG
ATTCCGCTGT TGAGCGCCGA ACTCGAGGAG TCGGCCATGC CGACCGTCGA CGGCGAGCCG
GACACCGACT GGATCGAGCC GCGAAAGTAC GTCGGCATCT GGTGGACGAT GATCGCGGGC
TCCGCCAACT GGGAGTACCG GCCGGACGAC TCGTTCGACA GTCCCGAGGA CGCGGCGGGA
TACGTCCACG GCGCCCGGAC CGAGCGGATG AAGCGGTACA TGAACTTCGC GGCGGAGAAC
GGCATCGACA GCGTCCTCGT CGAAGGGTGG AACGAGGGCT GGGACACCTA CCCCGGCGAC
GGCACGGGCC TCGAGTTCGG CGTTGACGAC TCCTATCCGG ACTTCGACGT CCGCGAGGTG
ACCGACTTCG GGCGCTCGCT CGAGTCCGGC GTGGAGATGA CGATCCACAA CGAGACCGCG
GGCGCGCTGC CCCACTACGA GGACCAGTTC CTGAACGACG ACATCTTCCG GCAGTACGAG
GACGTCGGCA TTCACTCGAT CAAGAACGGC TACGTCTCCG ACGAGGGGCT GGGCATCGAG
GGCGACGGCG CCGAGCCGAC GCACAACCAG CACAATCAGC TGGCGGTCAA CCACCACCGG
CTGGTCATCG AAGCCGCCGC GGCCAATCGC CAGCTGCTCG AGATCCACGA GGGGATCAAG
CCGACCGGAG AGATCCGGAC CTACCCCAAC GTGGCCAACC GCGAGGTCGT GAAAGCCCAG
GAGTACGACG GCTTCGAGCA GTTGAGCGCG GACGTCGCTC CGGATCACCA CGTCACGCTG
CCGTTCACCC GGAACCTGGC CGGTCCGGTC AGCTACCAGC CGGGCATCTT CGACATCACG
TTCAACGACG ATCGAGGCGG ACAGATCCAG ACGACGCGGG CGAAGCAACT CGCCCTCTAC
CCGAACTACC TGAGCGGGCT GCAGATGGTC GCCGACCGCG TCGAGGCCTA CGTCGACGAG
ACCCTCGCGG TCGGTGAGTG TCTGCAGGCC GCCTCGGGCG ACATCGACGG CTTCGTCACG
CTGGACGAGT GGCGAAACGC CTTCGGCACG AACTACGTCG CGGTCGACCC CAATCGCGTT
CCGTCGGGAT CGTCGGTCTC CTTCATCGTC GAAGACGCCG ACGCGGGGAC ACACGAACTC
CATCTCCGAT ACGCGGCGGC GCCCGAAGAC AACGCCCAGC GAGTCGTCGA GGCCGGCGAA
ACTCAGGCGA CGCTGCGCGT CAACGGCGAG ACGACGACCA TCAACCCAGA CTTCACAGAC
TACTGGGACC AGTGGGAGGT CTTCACGACG GAGATCGACC TCGAGGACGG CGACAACGAG
GTCGCGATCG AACTGCAGTA CGACGACGGC GAGGAATTCG AGGGCGACGT CGGCGGCTTC
AACCTCAACA CGATCGGGAT CACCGAACCC GGCGATCGGT CCCCGATGCC GGCCGAGTAC
GAGGGCTACA CGCCCGAGAA CGAGAACTTC GACGCGAAAC CCGCCTTCGA GTTCATCGAG
TCGGTTCCCG CGGCCGGCTG GGACGAGACG AACGTCGTCG ATAGCGAGAT CGGCGACTAC
GTCGTCACCG CCCGCCGAAA GGGCGAAGAG TGGTACGTCG GCGCGATGAC CGACGAGGGC
GGCCGCGCGG TCGACGTGCC GCTCGAGTTC CTCGCTCCGG GGAATTCGGG CTGTCAGGGG
CGCGGCATGG GATCGAAAGG GCACGGTAAT GGCCCCAAGG AACCCAAGTA CGTCGCCGAG
ATCTACTCGG ACGGCCTCGG CGGCGGCTAC GAGTCCGATC CCGAGGCCGT CAGGATCGAC
GAAGCCGTCG TCGATCCGAG CGCGACGGTC CTCGCCTCGA TGGCCCGCAG CGGCGGGACC
GCGATCCGGC TCCGCCCCGC GACGGGGACG GAGCGCAAGA GACTCCCGAC CTACGAGCGC
CCTACCCAGG ACGTGCGCTA CGAGATCGAC GATCAGGCGG GACTCGGCGA GCCGTTCATC
GCGGCCACCG GCTCGAACGA CGGCGATTTC GTCGGCGGGA CGACGGTCGC GATCGAGATC
GACGGCGAGC GCGAGACCGT CGACAACGTC CGGCTCTCGC CCGGAGCGAC CGACGAGACC
GTCGAGCTCG GCTACGCGAT CACGTCGATC GGCACGTACG ACGTCGTCCT GCGCGACCCG
GACGATGGGA CCGTCCTCGC GTCCGAAACG GTCACCGTCG CCCCCGGCGA CCTCGTCGCC
GAGTTCGACG ATCCGGCCGG CGACGACCAC GGTCCCGGCG AGTACACCTA CCCGACCAGC
GACGACTTCC GGGACGGGGC GTTCGACCTG CGCTCGTTCG CCGTCTACGA GGCCGACGAC
GCGTATCGGT TCGTCTTCGA AGTAGAGGAG CTCTACGACA CCTTCGGCGG GGAGTTCTCG
CCCCACTACT TCGCGGTCTA TCTCCGGGAT CCGTCCCGGG ACGGCGGCCG AACGACGGAA
CTCGGCGACC TCGAGGTCAC CGCCGCCTTC GAAGAGCCCT GGCACTACCG CGTCGCCGCC
AGCGGCTTCG GCTCGAGCGT CGTCGACGCG GCCGGTACCA GCCTCGGATC GCCGACGACC
GTCGTCGACT TCGAGAGCGA CACGGCCATC CTCTCCGTCG AGAAAGGCAC GCTCAGCGTG
GACATCGCGA ACGCCGAGGT CGTCCCCGTG GTCGGCTCCG AGGACCGCGG GACGTTCCGT
GCCGTCGACG TCGAGGCCGA GGGCTACGTT TTCGGCGGCG CACGCGAAGA CGCGATCGAG
AACGCCCCGC GGATCATCGA CCACCTGACG CCACCGGGCG TCGATCAGTC CGACGCGCTC
GCGTACGACG CGGACTCGCT GGCGACGCTC CCGTTCGTCC CGCTGTGA
 
Protein sequence
MLAAAAYTRA VPERAAARVS NGDDDAVQRV SSPDGTVEMT VDVSDGTPRY AVAVDGTTYL 
EPSTVGFDFR NQPSFGASAD GTTGSEVAVT GSERASETEV WEPVWGSYDR VSAEYNALII
GLEETAEPGR SANLEVRVFD DGVGLRVVFG DSFASNSGRA VVTSENTAFA FADDDTAWWI
RNEVTNPRFE QEYEETPLSE IPGSSRETRP TGTPMRNGAH TPLTVEAGDG DVYLSVHEAD
LEDYAAATLA PRSDEGGTEL VTELTPLPDG TKASLSLPNA TPWRTIQIGR RPGDLIESEL
IPLLSAELEE SAMPTVDGEP DTDWIEPRKY VGIWWTMIAG SANWEYRPDD SFDSPEDAAG
YVHGARTERM KRYMNFAAEN GIDSVLVEGW NEGWDTYPGD GTGLEFGVDD SYPDFDVREV
TDFGRSLESG VEMTIHNETA GALPHYEDQF LNDDIFRQYE DVGIHSIKNG YVSDEGLGIE
GDGAEPTHNQ HNQLAVNHHR LVIEAAAANR QLLEIHEGIK PTGEIRTYPN VANREVVKAQ
EYDGFEQLSA DVAPDHHVTL PFTRNLAGPV SYQPGIFDIT FNDDRGGQIQ TTRAKQLALY
PNYLSGLQMV ADRVEAYVDE TLAVGECLQA ASGDIDGFVT LDEWRNAFGT NYVAVDPNRV
PSGSSVSFIV EDADAGTHEL HLRYAAAPED NAQRVVEAGE TQATLRVNGE TTTINPDFTD
YWDQWEVFTT EIDLEDGDNE VAIELQYDDG EEFEGDVGGF NLNTIGITEP GDRSPMPAEY
EGYTPENENF DAKPAFEFIE SVPAAGWDET NVVDSEIGDY VVTARRKGEE WYVGAMTDEG
GRAVDVPLEF LAPGNSGCQG RGMGSKGHGN GPKEPKYVAE IYSDGLGGGY ESDPEAVRID
EAVVDPSATV LASMARSGGT AIRLRPATGT ERKRLPTYER PTQDVRYEID DQAGLGEPFI
AATGSNDGDF VGGTTVAIEI DGERETVDNV RLSPGATDET VELGYAITSI GTYDVVLRDP
DDGTVLASET VTVAPGDLVA EFDDPAGDDH GPGEYTYPTS DDFRDGAFDL RSFAVYEADD
AYRFVFEVEE LYDTFGGEFS PHYFAVYLRD PSRDGGRTTE LGDLEVTAAF EEPWHYRVAA
SGFGSSVVDA AGTSLGSPTT VVDFESDTAI LSVEKGTLSV DIANAEVVPV VGSEDRGTFR
AVDVEAEGYV FGGAREDAIE NAPRIIDHLT PPGVDQSDAL AYDADSLATL PFVPL