Gene Htur_2957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2957 
Symbol 
ID8743574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3036980 
End bp3039976 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content65% 
IMG OID646513541 
ProductOligosaccharyl transferase STT3 subunit 
Protein accessionYP_003404498 
Protein GI284166219 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGG ACACCGAACA CGTCGAGGAG GACTCCGAGA CGTCATCCTC CCTCCTCGAG 
ACGTGGTCTG ACTGGTATCA CATCCCCGTG CTCGGGGCTG TGATGCTGTT TATGTTCTGG
GTGCGGACCC AGGCGTACGA CCGATTCGTT ACCGAAGACG GGAGTCCCGC CCTCGCGGGC
GTCGACTCGT GGTATCACTG GCGAACAATC GAATGGACGG CGGAGAACTA CCCGAACACG
ATGCCCTACG AGGTGTGGAC CGGGTTTCCG GAAGGCAACT ACGTCGGCCA GTTCGGTACC
CTGTTCGACC AACTAATCGT CACGGTCGCG ATGATCGTCG GGCTCGGCAA TCCGTCGACT
GAGACCGTCT ACACCGTCGC ACTACTGATG ATCCCGGCGA TGGCCGCGCT CGTGGCGATT
CCGGTCTTCT ACGCGGGTCG GCGCCTCGGC GGCACGCTCG GGGGCATCGT CTCCGTGCTC
GTGCTCGCAC TCGCGAAAGG GCAGTTCCTG ACGCGGTCGA CGGTCGGCCA GCTCGACCAC
CACGTCGGCG AGGTGCTGTT CATGGCGATC GCCGTCGTCG CGATGATGGT CGCGCTCACC
GTCGCCGAGC GCGACCAGCC GATCTACGAA CTGGTCGTCG ACAGGGACTG GGACGCGCTG
CGAACGTCGA CCATCTACAG CGTCCTCGCC GGGGTCGCGC TCACGCTGTA CATCTGGGTC
TGGCCCTCGG CGGTCCTCCT GGTCGGTATC TTCGGCGTCT TCTTCGCCGT GCAACTCTGC
TTCGACTACC TCCGCGGGAT CTCGCCGGAC CACGTCGCCT TCGTCGGCGC GGTGAGCCTC
GGCGTGACCG CGATCCTGAC GCTCCTCCTG ATGGAACAGC CCGGAAGCAC GAGTTCGACG
AGCTTCGGAC TGCTCCAGCC CCTCGCCGCC TTCCTCGTGG CCGTCGGCTG CGTCTTCATG
GCCTGGGTCG CCCGCCAGTG GAACGATCGC GATATCGAGC GGCGGTACTA CCCCGTTGCC
ATCGCCGGCC TCATCGCCGC GGCGCTGCTG GCGATGTGGC TTCTCCTGCC CAGCCTGTTC
GACTCGATCG TCGGGAACGC GACGCGGCGC ATGCTCCCGT TTGGCGGAAC GGCCACCGAC
CTCACGATTT CCGAGGCCCA ACCCCCGGAG AACTTCCTCG ACAGCGTCTT TAGCGAGTTC
GGAAGCGCGT TCTACACGAT GCTCGCCGGC CTCGCTTTCA TCGTCATTCG CCCGCTGTTC
GGCCGCAAGA TCCGCGCCGA GCACACGCTG GTCGTCGTCT GGACGCTGTT CCTGATCAGC
ATGGCGGCGA CGCAGGTTCG CTTCTCGTAC TACCTCGTGC TCGGCGTCGC GGTCGTCAAC
GCCGCGTTCG TCGCGGAGTT CGTTCGCCTC TTCGATCTGG ACCTTCAGGC GAGCCTCGAG
TCGATTCGCG GAATCGAGAC CTATCAGGTG ATCGTGCTTT TCCTCGTCGT GATCCTGCTG
TTCGCGCCGC TACTCCCGCC GATCGCCGCC GACGATACGA CCTCTTGGGA ACAAGCCAAT
AGCACCGGCC CGTCCTACGA CGCCACCATG GTCTGGGAGG GGTCGAACGA GTGGCTCGCC
AACAACACGC CCGCGCCGGG CAACTGGGGC GAGCACGAAA ACGCCGATCA GCTCGAGTAC
TTCGGATCGT ACGACCGCCC CGCCGACGGG GATTACGACT ATCCCGAAGG CTCCTACGGC
GTGATGTCGT GGTGGGACTA CGGCCACCTG ATAACGACGC AGGGCGAGCG GATCCCCCAC
GCGAACCCGT TCCAGCAGAA CGCGCCGTCC GCCTCGGCGT TCCTGACCGC CGAGTCCGAG
GAACGCGGCG AACTGGTCCT CGATACCATC GCCGCCGGCG AGAACCCGGC CGACAAGTCC
ACCGAGGAAC TCGAGTCGCT GTCCGAGGGC GCCACCCACG AGCAGATGCG GTACGTGATG
GTCGACTACG AGATGGCGGC CGGGAAGTTC TCCGCGATCA CCGCGTGGTC CGGGCCGAAC
TACGAACACT ACGTGACGCC GTCGGGCCAC GAGGACGGAG AGCCGGTGAA CATCAATGAC
TACCAGAACG GGACCATCCC CTACTACGAC ACGATGCAGT CCCAGCTGTA CTTCGACGAC
GCGGACGGGC TCGAGCACTA CCGGACCGTC CACGAAAACG CCGACGCTGG GACGGCGACG
ATCGCCACCT ACGCGAATGT GTTCGTCCCC GGCGCCATTC AGGGACAGGT CGCCCAGGAG
CTGCAGGAAC TGGGCTACCA GGAGGGTGAC GTGATCTACT ACCAGGACGG CGAGTTCGCG
GCGCCCGGCG AGGGACGTCC GTCGGTAGCG ATACAGCAGA TGGGATTCCA GCGGATTCAG
AGCATCCAAC AGAACCCGAC CCAGCAGCTA CTGGGCGTCC GGAGCGCCGC CGCGGTGAAG
ACGTTCGAAC GCGTCGAGGG CGCGACGCTC ACTGGCACCG TCGACGACGC AGACGACGTG
CTCGGAAACG AGACGACGGC CACCGTCGAG GTCGAACTCG AGACGAACGC CGGTCGGACG
TTCAACTACA CGCAGGAGAC CGAAGTCGCC GAGGACGGCT CGTTCGAACT GACCGTTCCC
TACGCGACCA ACGACGAACT CGGCGTCGAG GACGGCTACA CGGACAGCGC CGTCGAAGCG
ACCGGCGAGT ACGACGTGAC CGTCGGCGAA CCGGGCGAAA CCGGCTACGC GGCGGAGACG
GCCGTCCCCG AGACCGCAGT GGTTCAGGGC GAGACCATCA CCGTCGACGG GTTCGAGCAG
ACCGAACTCG AGGACCCCGA CGAGTCGGAA GACGGCAACG AGACCGACGG CAACGAAACC
GACGGCGGCT CGACGGACGG AAACGAAACC GACGGCGGGA ACGAAACCGA CGACGGCAAC
CAGACCGACG GCGGCAACGA AACCGGTTCG GACGGCGCCG AAGCCGAGGA CAACTAA
 
Protein sequence
MSTDTEHVEE DSETSSSLLE TWSDWYHIPV LGAVMLFMFW VRTQAYDRFV TEDGSPALAG 
VDSWYHWRTI EWTAENYPNT MPYEVWTGFP EGNYVGQFGT LFDQLIVTVA MIVGLGNPST
ETVYTVALLM IPAMAALVAI PVFYAGRRLG GTLGGIVSVL VLALAKGQFL TRSTVGQLDH
HVGEVLFMAI AVVAMMVALT VAERDQPIYE LVVDRDWDAL RTSTIYSVLA GVALTLYIWV
WPSAVLLVGI FGVFFAVQLC FDYLRGISPD HVAFVGAVSL GVTAILTLLL MEQPGSTSST
SFGLLQPLAA FLVAVGCVFM AWVARQWNDR DIERRYYPVA IAGLIAAALL AMWLLLPSLF
DSIVGNATRR MLPFGGTATD LTISEAQPPE NFLDSVFSEF GSAFYTMLAG LAFIVIRPLF
GRKIRAEHTL VVVWTLFLIS MAATQVRFSY YLVLGVAVVN AAFVAEFVRL FDLDLQASLE
SIRGIETYQV IVLFLVVILL FAPLLPPIAA DDTTSWEQAN STGPSYDATM VWEGSNEWLA
NNTPAPGNWG EHENADQLEY FGSYDRPADG DYDYPEGSYG VMSWWDYGHL ITTQGERIPH
ANPFQQNAPS ASAFLTAESE ERGELVLDTI AAGENPADKS TEELESLSEG ATHEQMRYVM
VDYEMAAGKF SAITAWSGPN YEHYVTPSGH EDGEPVNIND YQNGTIPYYD TMQSQLYFDD
ADGLEHYRTV HENADAGTAT IATYANVFVP GAIQGQVAQE LQELGYQEGD VIYYQDGEFA
APGEGRPSVA IQQMGFQRIQ SIQQNPTQQL LGVRSAAAVK TFERVEGATL TGTVDDADDV
LGNETTATVE VELETNAGRT FNYTQETEVA EDGSFELTVP YATNDELGVE DGYTDSAVEA
TGEYDVTVGE PGETGYAAET AVPETAVVQG ETITVDGFEQ TELEDPDESE DGNETDGNET
DGGSTDGNET DGGNETDDGN QTDGGNETGS DGAEAEDN