Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5098 |
Symbol | |
ID | 8745903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013748 |
Strand | - |
Start bp | 62188 |
End bp | 65331 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646515711 |
Product | hypothetical protein |
Protein accession | YP_003406658 |
Protein GI | 284176382 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.178666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGGCT TCGACATCCT TCCGTGCAAG ACGTGTTCGA CCACTTGGCT GCAAGAATCC GGGTGGCGAG CACAGGATAC CGTCTCGTGT CCGCATTGCG GGGCCGAACG CAGTACGGAT CTCGTTAAGA TCCGTGGCTC ACAGGAGACC AAGGCCGGCG CCGCCGAGCT TCGGTCTCGG ATCGAAGCCG CCGAAGCCGG CGAGTCAGAG GTGTACGACC AGTTCATCGA GGACCGTGGC CAGTACGCCG ACCAACTCGC CGAAGTCGAA GGCCAGATTG ACTCGTTCAC GCTCGACGCC GAGCAGATGG ATCTGACGCC GATCGAATCC GATCGGTTCG AACCGCTCGC CGAAACGATC CTCCGACCGG AACGGAAGAA GTTCAAGGAG TGGGCCGACG AGTACAGCGG CATCGGCGAG GACCGATTCG CCGACCTGGT ATCGTTCGAA CGACCCGGCG AGGGCTTGTT CGATGCCGAG ATTCGCGACG ACCTTGAGCA GTGGGCCGGC GACGTCGATC GGGACGAGCT GGCCCGCGGC GACGTAACGG TAACTGACCA GCAACCGGCC GCGGCCGCGA CCATCATCGA TCTGGACGCG ACGACGGTCT CCGAGGTATG GGCGGCGCTG TTCGACAGTG AGTCCGTCCG GGCCGCGTTC GCCGAGTCCA TCGTCGAACT GTTCGGTGGT CTCACACCCT TGGAGTGTTA CGACGTGCTC GAGGAGTACG GCGTCCCGTT CTGGGTTCGA TCCCACATCG TTGACGTTGC GCGTGGGTAC GCAGGCGACG CTGTGGCGAT CGACGGTGCC GCCGACGCCC GGCGCGTCGT CGACGAGATT ATCGCGCCGC TCCCGAACGC GTCCCTCGCT GGTACCGATG ACCTACTCGC CATCGCGAAT CTGTTCGACG GCCTCGAGAC CGAGCCGACA CTCGGCGTCG TTGTTCGGGA GTCGTTCATC GAGGATGTCC GGCGCGACCA GCGGATCGAT ATCTGTGACC TGCTGGCCGT GCTGGCCGCC GGCTGTGATG TTCGACTCGT TGGGTCGACC GTGACGCTCG CGAAGGTCGC GAACAGTCAC CGAGCGACCC TCCCCGGCGT TAGTGAGTGG TGCAATCGTC ACCGTGAAGA TACGCAGATC GACGACACTC AACAGCGTGT GGCCGACGAC CTCGAGCGCG GTGACTTTGC GGTCACAATG CTCCGCGAAC TGGACCGTGA ACCAACCGGG ATTTTCACGT ACTCCGAACT GTACGCGCTG TATCCCGGCG ATGATGACTC TCGTGTTCGC CAACTTGTCG GCGAGTTCCA CGACGCCGAT CTCGTCGAAC GCTTCGGTCC GCGAACCGAT CGGAAGGTCG AACTGCTCCC GGCTGGCCGA CGCGTTCTCG AGTTCTTCGA ACAGCAAATC GCACAACAGC GGTCGATTTC CGACTTCGTT AGCGGCGCCG GTAAACAACA ACAACAGGGC CGTGTACACA CCCAGACGGG AGGGGGTGGG GAGGACGGGG CCGGCGAAGA CAGCACCGAC GGCACCCGCC ACTACAGCAC CCGCTACATG AGCCCGGCCG AACACGCCGC CACCGCGGCG TGCGGACAGA ACGGCGGTGT GACGCTTGTG CGCGGGGGGA TAGAGGACCA CGCCGACCGG ACCCGGTACG CGAGTTACGA CCCGAAACGG GGGGAGGCCG TTGTGGCCGT ACAGGCCGCT GGACCCATGC AGATGACGGT GAGCGCTGCA TTAGCGTTAG CCAGCCCCGA GTTTGTGGAC CGAACACTCC CGGCAGACCG ACTCGAGTCC ATTGAGGACC CACCGGCGAT CGTGCGTGAC GCCCGCTGTA TCGGTGGGGC ATCCCAACAG GCTCTCGAGA ACGGCCAGCA GTTCCGCAAG GCGCTGGTCG AGTGGGGCAA GGATCTCTCG GAAATGACGA CCAAGCTCAA GGCCGGCAAC CTCTCGACGG ACCGCGCGGC CTTCTGTGGC GAGATCATCC GTTCGGCACA GGGTCTCTGG GGGACGCTCA CGCACCTGCT GGATCTGTTC GATATCGATG TCCACCGGGA GATTCGTATC CCGTCGGGCC TCTCGAGTGA CAATCTCGAG GACCTCGCGA AGTCGATCAG TTACGCGGCG GCTATCCAGT CGACGTACAA CGGCCACTTC GCGTGCTACC GGCAACTGTT CGAAGATCGG GACGACAAGC GCCGGGCGTC GTTCACCGCA CAGGTCGACG CGGCGGCGCC GACGGGCTCG CTCATCGGCT CGTTCGTTCT CCGTGGCCCG GACGTCCACC GACTCGAGGA ACCGCTTCAG ACGCGCCTCG AGTCGCCGCG TGACGTCCAC GACGACGCCC CGGAGTTCGG CGTCGATATC ACGGTCCGAA CCGACCTCGA GCGCACGGAC TACGACGAAG CCGTTCGCCG TGTCCTCTCC CGTAAGCGAC TCCGCACGAC GACCGCCGCC GTCTCGGTCC TGTACGCGCT CGTCGCGACG CCACACGACG CGGCTCGCGT GCTCCACCGC CAACTCGCCG CCGAAGACGA GTCCCGAGAG ATCCGGCCGG ATGAACTCCG AACCGCGCTT CGCGAGCTGG ACCCGACGGC ACTGCTCCCG ACGATCGGCA ACGAGCGCCG GACCAACTCG GCGGGCAAGA TCGTCGCGGC GCTGCTGGCC GCCGACGAAC CACTCTCGAA GGCCGACCTC GCCGACCGGG CCGGCGTCAC GAAAAAGACG GTCTACAACT ACCGCGAGAA ACTCGAAACG CTCGGCCTCC TGGTCGTCAC CGACGAGGGC TACCGGCTTG CACTGTCGTT CCCGACGACC GAGGAACGCA AACAGCCCGT ACTCCCGGCG TTCGTCGACC GGACGTTCAC CGAGGCCGCC GACGCGCTGC TCGTCGAGTC ACTCCCGCCG AGTCGCTACG GCGACCCGGA GGACTCACTC GGCGGCCTGT TGTTCTGGAC CGACGACAAC CCACCGAACC CGTGGGCGCT GCTCGAGCAC GACGACTACG CTCCGTGGGC GGAACTGGCC CGGAGACTCA CCGACGGCGA CCGGACGCGA CCGGCGGAAC TCCGTGTGTT AATGGGGCCG GAAATTAAGC AACAGTCGAT CGACGCGGCC ACCTCGAGCG CGGCGGCCGA CTAA
|
Protein sequence | MRGFDILPCK TCSTTWLQES GWRAQDTVSC PHCGAERSTD LVKIRGSQET KAGAAELRSR IEAAEAGESE VYDQFIEDRG QYADQLAEVE GQIDSFTLDA EQMDLTPIES DRFEPLAETI LRPERKKFKE WADEYSGIGE DRFADLVSFE RPGEGLFDAE IRDDLEQWAG DVDRDELARG DVTVTDQQPA AAATIIDLDA TTVSEVWAAL FDSESVRAAF AESIVELFGG LTPLECYDVL EEYGVPFWVR SHIVDVARGY AGDAVAIDGA ADARRVVDEI IAPLPNASLA GTDDLLAIAN LFDGLETEPT LGVVVRESFI EDVRRDQRID ICDLLAVLAA GCDVRLVGST VTLAKVANSH RATLPGVSEW CNRHREDTQI DDTQQRVADD LERGDFAVTM LRELDREPTG IFTYSELYAL YPGDDDSRVR QLVGEFHDAD LVERFGPRTD RKVELLPAGR RVLEFFEQQI AQQRSISDFV SGAGKQQQQG RVHTQTGGGG EDGAGEDSTD GTRHYSTRYM SPAEHAATAA CGQNGGVTLV RGGIEDHADR TRYASYDPKR GEAVVAVQAA GPMQMTVSAA LALASPEFVD RTLPADRLES IEDPPAIVRD ARCIGGASQQ ALENGQQFRK ALVEWGKDLS EMTTKLKAGN LSTDRAAFCG EIIRSAQGLW GTLTHLLDLF DIDVHREIRI PSGLSSDNLE DLAKSISYAA AIQSTYNGHF ACYRQLFEDR DDKRRASFTA QVDAAAPTGS LIGSFVLRGP DVHRLEEPLQ TRLESPRDVH DDAPEFGVDI TVRTDLERTD YDEAVRRVLS RKRLRTTTAA VSVLYALVAT PHDAARVLHR QLAAEDESRE IRPDELRTAL RELDPTALLP TIGNERRTNS AGKIVAALLA ADEPLSKADL ADRAGVTKKT VYNYREKLET LGLLVVTDEG YRLALSFPTT EERKQPVLPA FVDRTFTEAA DALLVESLPP SRYGDPEDSL GGLLFWTDDN PPNPWALLEH DDYAPWAELA RRLTDGDRTR PAELRVLMGP EIKQQSIDAA TSSAAAD
|
| |