Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2170 |
Symbol | |
ID | 8742772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2244311 |
End bp | 2245909 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646512752 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003403724 |
Protein GI | 284165445 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTCCC CAGTGCTGTT CATCCCCGGT CTCTCAGTCC TCCTCGGGCT CTACGGCTAC CTGGTCAAGT CACCGCTGCC GCTCCCGGCG AGCGATGAAA CGGAAACGGA GCCGACGGTA GAGGTGTTCA TTCCGGCGTT GAACGAAGAG GAGACGATCG GATACGCGAT CGCGTCGCTG ACGTACCAGA CCGTGCGACC GGACGCGGTA ACGGTCGTCG ACGACGGCTC GACGGACGCG ACTGCCGCGG TCGTCGAAGC CGTGCGAGAG CAGATCGACA TCGAGATCAA TTTCGTCCGG CACACGGACC GAGGAAGCAA GACCAAGCGT CTCAAACAGG TCACCCGGAA CAGCGATGCG GACAAGATAT TCGTCCTCGA CGCGGACACC TATCTCGTCA GCGAGACCTA CCTCGAACGG GTAATCGCCG CGCAGGAAGC CGAGGACGTG GCCTGTTCGT TCGGGGTCGT ACAGCCCGAT ACGCGGTCGA CGAAGCGGTC GTTCCACCGC GATAGCCTGG AGCCGCTGTT CCCGAACGGC GTCCCAGCCG AGGCGGTTCC GGACTGGATC GAGCGCGATC AATTGGGTCG GGACCGACCG TCGTATCTCG TCACTCGATG GCCCGTCGAA CAGTACCGGA GCATACTGTA CGCGATCGAG CAGCGCTTCT TCAAGGAGGC CCAGATGCGG CTGATCAGGA CGTCGCTGTT TCCGGCGGGT TGCGGGGTAC TCTACGATCG TCAGGCCCTC CGATCGGTGT TCGACGACTT CGAGCCCTCG CTGGGCAACC AGTTGACGAA CAGCGAAGAC ATCTTCATCG GCTTCTCGCT CGTCGACCGC GGATTCGCGA ACGTGCAGGT CTCGGACGTG AGGATGCGAA CGGTCGAGCC GACCCTCGCC GCGATGGCAG ATCAGACGTA CCTATGGAGT TCGTCGTTCC TCCAGAGCAC GTTCTATTAC CGGGTGTTCT CCCGATGGCT CCGATCGCAA ACCGGCGACG TGACCGACGA GATCCCCCGC GACGAACCCG GCCTGATCGC GGAGGAGACG ACCACCGTCG GGACGGACGG AGGGTTCGCG AGTCCGACCG ACTCCGAACG CGTCGGGACC CCTCCCACGG GAGCGGGTTC GGACGAACCG ACCACCCCCT CGAGCGTCCG GGCCGATCGG AACGAGGAGT CGCCATCGGT CGGCGTTCCG ACGTCCGCCG ATAGCTCGGC GGAGTCCGCG CCGGACCGGG AGATGGCCGC TGACCGAGAC AGCGGCGACG AGGACGACTC GGGATGGTTC GATCGGCGAC GCACGCTCAG TGCGGTCGTC GGTTCGCAGA TCGTCGACGG ACTGTACCCG ATCGCACTCC TCGTCGTCGG ACTGCTCTCC GTCTTCGGGC TCTTCCCGAT CGAGCTACTG CTGGCGGTCG TCGCCGTCGA ATTCGGCCTC TACCTGCTCA TCGCGGGACT GTTCGCCCAT CGCCAGATAA CGCTCCTGTC GTTGCTCGTG TCCGTACCCG TTCGGCTATC TCAGTTACCG GTCGGCGTGT ACGTCTACGC GCGGGTCGCG ACGGATCTGC TCAGGGGAAA ACGAAACTGG AACAAGTGA
|
Protein sequence | MRSPVLFIPG LSVLLGLYGY LVKSPLPLPA SDETETEPTV EVFIPALNEE ETIGYAIASL TYQTVRPDAV TVVDDGSTDA TAAVVEAVRE QIDIEINFVR HTDRGSKTKR LKQVTRNSDA DKIFVLDADT YLVSETYLER VIAAQEAEDV ACSFGVVQPD TRSTKRSFHR DSLEPLFPNG VPAEAVPDWI ERDQLGRDRP SYLVTRWPVE QYRSILYAIE QRFFKEAQMR LIRTSLFPAG CGVLYDRQAL RSVFDDFEPS LGNQLTNSED IFIGFSLVDR GFANVQVSDV RMRTVEPTLA AMADQTYLWS SSFLQSTFYY RVFSRWLRSQ TGDVTDEIPR DEPGLIAEET TTVGTDGGFA SPTDSERVGT PPTGAGSDEP TTPSSVRADR NEESPSVGVP TSADSSAESA PDREMAADRD SGDEDDSGWF DRRRTLSAVV GSQIVDGLYP IALLVVGLLS VFGLFPIELL LAVVAVEFGL YLLIAGLFAH RQITLLSLLV SVPVRLSQLP VGVYVYARVA TDLLRGKRNW NK
|
| |