Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2957 |
Symbol | |
ID | 8743574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3036980 |
End bp | 3039976 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646513541 |
Product | Oligosaccharyl transferase STT3 subunit |
Protein accession | YP_003404498 |
Protein GI | 284166219 |
COG category | [R] General function prediction only |
COG ID | [COG1287] Uncharacterized membrane protein, required for N-linked glycosylation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGG ACACCGAACA CGTCGAGGAG GACTCCGAGA CGTCATCCTC CCTCCTCGAG ACGTGGTCTG ACTGGTATCA CATCCCCGTG CTCGGGGCTG TGATGCTGTT TATGTTCTGG GTGCGGACCC AGGCGTACGA CCGATTCGTT ACCGAAGACG GGAGTCCCGC CCTCGCGGGC GTCGACTCGT GGTATCACTG GCGAACAATC GAATGGACGG CGGAGAACTA CCCGAACACG ATGCCCTACG AGGTGTGGAC CGGGTTTCCG GAAGGCAACT ACGTCGGCCA GTTCGGTACC CTGTTCGACC AACTAATCGT CACGGTCGCG ATGATCGTCG GGCTCGGCAA TCCGTCGACT GAGACCGTCT ACACCGTCGC ACTACTGATG ATCCCGGCGA TGGCCGCGCT CGTGGCGATT CCGGTCTTCT ACGCGGGTCG GCGCCTCGGC GGCACGCTCG GGGGCATCGT CTCCGTGCTC GTGCTCGCAC TCGCGAAAGG GCAGTTCCTG ACGCGGTCGA CGGTCGGCCA GCTCGACCAC CACGTCGGCG AGGTGCTGTT CATGGCGATC GCCGTCGTCG CGATGATGGT CGCGCTCACC GTCGCCGAGC GCGACCAGCC GATCTACGAA CTGGTCGTCG ACAGGGACTG GGACGCGCTG CGAACGTCGA CCATCTACAG CGTCCTCGCC GGGGTCGCGC TCACGCTGTA CATCTGGGTC TGGCCCTCGG CGGTCCTCCT GGTCGGTATC TTCGGCGTCT TCTTCGCCGT GCAACTCTGC TTCGACTACC TCCGCGGGAT CTCGCCGGAC CACGTCGCCT TCGTCGGCGC GGTGAGCCTC GGCGTGACCG CGATCCTGAC GCTCCTCCTG ATGGAACAGC CCGGAAGCAC GAGTTCGACG AGCTTCGGAC TGCTCCAGCC CCTCGCCGCC TTCCTCGTGG CCGTCGGCTG CGTCTTCATG GCCTGGGTCG CCCGCCAGTG GAACGATCGC GATATCGAGC GGCGGTACTA CCCCGTTGCC ATCGCCGGCC TCATCGCCGC GGCGCTGCTG GCGATGTGGC TTCTCCTGCC CAGCCTGTTC GACTCGATCG TCGGGAACGC GACGCGGCGC ATGCTCCCGT TTGGCGGAAC GGCCACCGAC CTCACGATTT CCGAGGCCCA ACCCCCGGAG AACTTCCTCG ACAGCGTCTT TAGCGAGTTC GGAAGCGCGT TCTACACGAT GCTCGCCGGC CTCGCTTTCA TCGTCATTCG CCCGCTGTTC GGCCGCAAGA TCCGCGCCGA GCACACGCTG GTCGTCGTCT GGACGCTGTT CCTGATCAGC ATGGCGGCGA CGCAGGTTCG CTTCTCGTAC TACCTCGTGC TCGGCGTCGC GGTCGTCAAC GCCGCGTTCG TCGCGGAGTT CGTTCGCCTC TTCGATCTGG ACCTTCAGGC GAGCCTCGAG TCGATTCGCG GAATCGAGAC CTATCAGGTG ATCGTGCTTT TCCTCGTCGT GATCCTGCTG TTCGCGCCGC TACTCCCGCC GATCGCCGCC GACGATACGA CCTCTTGGGA ACAAGCCAAT AGCACCGGCC CGTCCTACGA CGCCACCATG GTCTGGGAGG GGTCGAACGA GTGGCTCGCC AACAACACGC CCGCGCCGGG CAACTGGGGC GAGCACGAAA ACGCCGATCA GCTCGAGTAC TTCGGATCGT ACGACCGCCC CGCCGACGGG GATTACGACT ATCCCGAAGG CTCCTACGGC GTGATGTCGT GGTGGGACTA CGGCCACCTG ATAACGACGC AGGGCGAGCG GATCCCCCAC GCGAACCCGT TCCAGCAGAA CGCGCCGTCC GCCTCGGCGT TCCTGACCGC CGAGTCCGAG GAACGCGGCG AACTGGTCCT CGATACCATC GCCGCCGGCG AGAACCCGGC CGACAAGTCC ACCGAGGAAC TCGAGTCGCT GTCCGAGGGC GCCACCCACG AGCAGATGCG GTACGTGATG GTCGACTACG AGATGGCGGC CGGGAAGTTC TCCGCGATCA CCGCGTGGTC CGGGCCGAAC TACGAACACT ACGTGACGCC GTCGGGCCAC GAGGACGGAG AGCCGGTGAA CATCAATGAC TACCAGAACG GGACCATCCC CTACTACGAC ACGATGCAGT CCCAGCTGTA CTTCGACGAC GCGGACGGGC TCGAGCACTA CCGGACCGTC CACGAAAACG CCGACGCTGG GACGGCGACG ATCGCCACCT ACGCGAATGT GTTCGTCCCC GGCGCCATTC AGGGACAGGT CGCCCAGGAG CTGCAGGAAC TGGGCTACCA GGAGGGTGAC GTGATCTACT ACCAGGACGG CGAGTTCGCG GCGCCCGGCG AGGGACGTCC GTCGGTAGCG ATACAGCAGA TGGGATTCCA GCGGATTCAG AGCATCCAAC AGAACCCGAC CCAGCAGCTA CTGGGCGTCC GGAGCGCCGC CGCGGTGAAG ACGTTCGAAC GCGTCGAGGG CGCGACGCTC ACTGGCACCG TCGACGACGC AGACGACGTG CTCGGAAACG AGACGACGGC CACCGTCGAG GTCGAACTCG AGACGAACGC CGGTCGGACG TTCAACTACA CGCAGGAGAC CGAAGTCGCC GAGGACGGCT CGTTCGAACT GACCGTTCCC TACGCGACCA ACGACGAACT CGGCGTCGAG GACGGCTACA CGGACAGCGC CGTCGAAGCG ACCGGCGAGT ACGACGTGAC CGTCGGCGAA CCGGGCGAAA CCGGCTACGC GGCGGAGACG GCCGTCCCCG AGACCGCAGT GGTTCAGGGC GAGACCATCA CCGTCGACGG GTTCGAGCAG ACCGAACTCG AGGACCCCGA CGAGTCGGAA GACGGCAACG AGACCGACGG CAACGAAACC GACGGCGGCT CGACGGACGG AAACGAAACC GACGGCGGGA ACGAAACCGA CGACGGCAAC CAGACCGACG GCGGCAACGA AACCGGTTCG GACGGCGCCG AAGCCGAGGA CAACTAA
|
Protein sequence | MSTDTEHVEE DSETSSSLLE TWSDWYHIPV LGAVMLFMFW VRTQAYDRFV TEDGSPALAG VDSWYHWRTI EWTAENYPNT MPYEVWTGFP EGNYVGQFGT LFDQLIVTVA MIVGLGNPST ETVYTVALLM IPAMAALVAI PVFYAGRRLG GTLGGIVSVL VLALAKGQFL TRSTVGQLDH HVGEVLFMAI AVVAMMVALT VAERDQPIYE LVVDRDWDAL RTSTIYSVLA GVALTLYIWV WPSAVLLVGI FGVFFAVQLC FDYLRGISPD HVAFVGAVSL GVTAILTLLL MEQPGSTSST SFGLLQPLAA FLVAVGCVFM AWVARQWNDR DIERRYYPVA IAGLIAAALL AMWLLLPSLF DSIVGNATRR MLPFGGTATD LTISEAQPPE NFLDSVFSEF GSAFYTMLAG LAFIVIRPLF GRKIRAEHTL VVVWTLFLIS MAATQVRFSY YLVLGVAVVN AAFVAEFVRL FDLDLQASLE SIRGIETYQV IVLFLVVILL FAPLLPPIAA DDTTSWEQAN STGPSYDATM VWEGSNEWLA NNTPAPGNWG EHENADQLEY FGSYDRPADG DYDYPEGSYG VMSWWDYGHL ITTQGERIPH ANPFQQNAPS ASAFLTAESE ERGELVLDTI AAGENPADKS TEELESLSEG ATHEQMRYVM VDYEMAAGKF SAITAWSGPN YEHYVTPSGH EDGEPVNIND YQNGTIPYYD TMQSQLYFDD ADGLEHYRTV HENADAGTAT IATYANVFVP GAIQGQVAQE LQELGYQEGD VIYYQDGEFA APGEGRPSVA IQQMGFQRIQ SIQQNPTQQL LGVRSAAAVK TFERVEGATL TGTVDDADDV LGNETTATVE VELETNAGRT FNYTQETEVA EDGSFELTVP YATNDELGVE DGYTDSAVEA TGEYDVTVGE PGETGYAAET AVPETAVVQG ETITVDGFEQ TELEDPDESE DGNETDGNET DGGSTDGNET DGGNETDDGN QTDGGNETGS DGAEAEDN
|
| |