Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0112 |
Symbol | |
ID | 8740675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 120566 |
End bp | 123271 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646510675 |
Product | TRAP transporter, 4TM/12TM fusion protein |
Protein accession | YP_003401686 |
Protein GI | 284163407 |
COG category | [R] General function prediction only |
COG ID | [COG4666] TRAP-type uncharacterized transport system, fused permease components |
TIGRFAM ID | [TIGR02123] TRAP transporter, 4TM/12TM fusion protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATAG ATACGAGCGG TACGGACACG GTTTCGGACG AACAGACAGA CGAGGTACTT GAGGAAATCG AGCGGCGGCG GACGCTTCGA GGACCCGCGG CCGTTCTCGT CGCCCTGATC GGAATCAGCT TCTCGGCGTT CCAGATGTGG ATCGCCGCCC GTGGGCGCCA GTTCGGTGGA ACGCTGCCCG TGATCGGTGA GTTCCAGATT ATCTCGCTAC AACAGTTACA GGTCAACGCG ATCCACGTCA CGTTCGCGCT GGTGCTCGCC TTTCTGCTGT TTCCGGCAAG CGAAGGCGAC GGGTTCGTCG CGCAGCAGCT CGGTCGGATC CCGCCAGCTG TCCGCGACCG CTTGGGCGCC GACCACGCCG TCTCGAGGGC GATCACTCGA CTCGGGGACG GCGTCCGCTG GGCCGTCGTC GATCCGTCAC GGGACCGAAT TACGCCGCTG GACGTCGCGA TGATCGCCCT CGCACTCTGG CCGGCCTACT ACATCACCAC CGAGTTCGAC GAGATCCGGT CGCTCCCGAT CATCGGCCTC GAAAACGCCA GTGCGATCCA CGAGCTGTAT CCGTGGCTCG AGCCGCTCGT GGTGCCGCTC GCGTCCATGG GACTTCCGGT CGACTTCCCC GTCGCGTACC TCCTCGGAAT CGTCGGCATC CTGCTGGTGC TCGAGGCGAC CAGACGAACG CTCGGCGTCG TCCTCATGGG GCTGGTCGCG TCGTTTATCG TCTACGCCCG CTGGGGCTAC ATGATCCCCA GCGACTCGCC GATCGGCGCG CTGGCGATTC AGGTCATCGA GTGGGACAAC ATCGTCTACA ACCTCTGGTA CACGGTCGAG GCGGGGGTGT TCAGCACGCC CGTCAGCGTC AGCGTCCGGT TTATCTACAT CTTCATCCTC TTCGGCGCGT TCCTCGAGAT GAGCGGTGCC GGCAAGTGGT TCATCGATCT CGCCTACTCG ATGACCGGCA CCAGAAAGGG CGGCCCGGCG AAGGCGAGCG TCGTCTCGAG CGGGTTCATG GGTATGCTCA GCGGCTCGTC GATCGCGAAC ACGGTGACGA CGGGCGCCTT TACGATTCCG CTGATGAAGC GGTCGGGCTA CTCGCCCGAG TTCTCCGGCG CGGTGGAGTC GTCGGCGTCG TCCGGCGGGC AGATCCTCCC GCCGGTCATG GGCGCCGTCG CGTTCCTCAT GGTCCAACTC ATCGGTGAGC CGTACTCGAA CATCATTATC GCGGCCACGA TCCCCGCGTT CGCGTTCTTC TTCGGCATGT GGGTGATGGT CCACTTCGAG GCCGTCAAAG GCGGAATCGG CGGCATCCCG CGTGCGGAAC TCCCCGACGT CTCGGCGGCG ATCCGCACCG GCTGGTTCTA CCTCATCCCG CTCGTTCTGC TGGTCTACTT CCTGGTCATC GCTCGGTTCT CGATCAACCG CGCGGGCTGG TACACGATCG TCACCATCAC CGCGCTGATC GCGGTCGTCG CCGCGTACAA CGAGCGGACG CGCCTCCCGC TGCTGGGTTC GATTGCCGCG CTCTACCTCG CCCAGGCCGC CGCCTTGGCG AGTTACGGCG TCGGTCTCGG CGACGCGATT CAGGTTGCAC TCGGCCTCGA GTCGGCGAGC GCGGCGTACT CGATTCGCGA CGCGGCGGTC GCCGCGGCCG CGGATCTCGG CCTCATCGCG ATCCTCGTCA GTCTCGCGGT CATGCTCGCT CGTCCGCGAG GTGACGCGCC GTTGCTCGAA CTCGACGAAG CGGTCGACGA CGCCGCGACC GCGACCGCGG CGTCGATCGA TCGCCCCGCG CTCGCTCGGA ACACCGGCTA CCGGTTCGGG GCGTTCATCC TGAAATCGAT GGACTCGGGC GCGCGTACCG CGACGACGGT CGTCGTCGCC GTCGCGGCCG CGGGCGTCGT CCCGGGAGTC ATCAGCGTCT CCGGGCTCGG CCCGAACCTC GCGGCGCTCA TCAACACCGT CAGCGGGGGC TCGATGCTGA TGCTGCTCGT TCTGACGGGA CTCGCGTCGA TCATCTTCGG GATGGGGATG CCCACGACGG CCATGTACAT CATCCTGATC GCGATGCTCG GCGGCCCGAT CGAGGACATG GGCGTTTGGC TGGTTGCGGC GCACCTCTTC GTCCTCTACT TCGGCCTGAT GGCTGACGTC ACGCCGCCGG TCGCCGTCGC CGCCTTCGCA GGCGCGGGGG TCGCCAAGGC CGACGAGCTC AAGACCGCGA GCATCGCGTT CCTCCTCTCG CTGAACAAGA TCCTCGTCCC CTTCGCCTTC GTGTTCTCGC CGGGGATCGT CCTCGCGCGG AAAGTCGACG GCGAGTGGGG GCTCATCGGC TGGAGCGACG TCGCCGATGT CGGCTTCTTC CTCCCCGAAG TCATCGTCCC CGTTATCGGG ATGTTCGTCG GGGTGTACGC GCTCGGCGTC ACCATCATCG GCTACCAGTA CTCGGCGGTC GACTCGACCC GGCGCGCCCT GTACGCCGTG GCGTCGATCC TGCTGATGGT TCCCGAGATC CCGCTTCTCG TCGTCGAAGG AGCGCTCGCA CTGGCCGGGC TGTCGATCGG ACTGACCGGC TTGTGGGTGA CCGTCTCTCT GCGGCTCCTC GGCCTCGCGA TCCTCGCCTC GCTGTCGTAT CGCAACTATT CTCGGCTGCC TGACGAACAG TCGGACCCCG CCGCGCCCAC AGCGGGCAAC GCCTAA
|
Protein sequence | MSIDTSGTDT VSDEQTDEVL EEIERRRTLR GPAAVLVALI GISFSAFQMW IAARGRQFGG TLPVIGEFQI ISLQQLQVNA IHVTFALVLA FLLFPASEGD GFVAQQLGRI PPAVRDRLGA DHAVSRAITR LGDGVRWAVV DPSRDRITPL DVAMIALALW PAYYITTEFD EIRSLPIIGL ENASAIHELY PWLEPLVVPL ASMGLPVDFP VAYLLGIVGI LLVLEATRRT LGVVLMGLVA SFIVYARWGY MIPSDSPIGA LAIQVIEWDN IVYNLWYTVE AGVFSTPVSV SVRFIYIFIL FGAFLEMSGA GKWFIDLAYS MTGTRKGGPA KASVVSSGFM GMLSGSSIAN TVTTGAFTIP LMKRSGYSPE FSGAVESSAS SGGQILPPVM GAVAFLMVQL IGEPYSNIII AATIPAFAFF FGMWVMVHFE AVKGGIGGIP RAELPDVSAA IRTGWFYLIP LVLLVYFLVI ARFSINRAGW YTIVTITALI AVVAAYNERT RLPLLGSIAA LYLAQAAALA SYGVGLGDAI QVALGLESAS AAYSIRDAAV AAAADLGLIA ILVSLAVMLA RPRGDAPLLE LDEAVDDAAT ATAASIDRPA LARNTGYRFG AFILKSMDSG ARTATTVVVA VAAAGVVPGV ISVSGLGPNL AALINTVSGG SMLMLLVLTG LASIIFGMGM PTTAMYIILI AMLGGPIEDM GVWLVAAHLF VLYFGLMADV TPPVAVAAFA GAGVAKADEL KTASIAFLLS LNKILVPFAF VFSPGIVLAR KVDGEWGLIG WSDVADVGFF LPEVIVPVIG MFVGVYALGV TIIGYQYSAV DSTRRALYAV ASILLMVPEI PLLVVEGALA LAGLSIGLTG LWVTVSLRLL GLAILASLSY RNYSRLPDEQ SDPAAPTAGN A
|
| |