Gene Htur_3524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3524 
Symbol 
ID8744144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp3627836 
End bp3629446 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content67% 
IMG OID646514105 
Productglycosyl transferase family 39 
Protein accessionYP_003405059 
Protein GI284166780 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCGT CCCTGCCGAT CGAGACGGGC GGTGAGGGAA ATCGGGATCG ATCACTGGCG 
GGCGTCCCGG TCGAACTGTA TCCGATCGTC CTCATCGCGA CTGCGCTGCG GCTGTTCCGG
CTCGAGTCCG AGAGTTACTG GGTCGACGAA GTCGTCTCGG TGACCACCGT CACGTCGAAC
ACGCCGATCG AACTCCTGAT CAGCGTGCCG GGGAACGATC CCCACCCGCC GTTCTACTAC
CTCCTCCTGT CCGGCTGGAC GGCCCTCTTC GGCACGAGCG AACTGGCCAC GCGACTGCTG
TCGGCCCTCG CCGGCATCGC CACCGTGGTC GTCCTCTACG GGATCGGGCG GCGACTCTTC
GACAGGGAGG TCGGAGCGAT CGCCGCGGTC CTCGTCGCCG TCTCGCCGTT CCACGTCTGG
TACGCACAGG AGGTCCGAAT GTACAACCTG CTCGCGTTGC TGACCGCGCT CTCCGTCTAC
TACTTCGTGC GGATACAGAC GGATCGAGCG ACCGACGAGT CCGGGTTTCG AACCGAGATC
GGGTACGTCG TCTCCACGGT CCTGCTCGGC TACACGCACG TCTTCGGGCT GTTCGTGATA
CTCGCCCAGA ACGCCTACGT CTTCTCACGA CCGCTCGTCC GGATCGTCCC CCGGTCGCGA
CTGACGCTCC GCCGCTGGTT CGAACTCGAG GCGCTCACCG CCCTGCTACT CGCGCCGTGG
CTGGTGAAAC TGGTTCGCCG AATGCTGGCG GCGCGTGCTG GCGAGACGAC CAACGTCTCG
TGGATCCCCC TACCGACGGC CGAGACCGTC AGGGAGACGT TCGCGGCGTA CCTCGGCGCC
TACCTGTTCG AGGAGTCGTT CCCCCTTCTC GTCTCGCTCG TCGTCGTCGG CTGTCTGGTG
CTCGCGCTCT CGAGCGGCCG ATACGTGGCG ACGGAGCCCG GCACCAGTGC GGAAGCCGAC
CGGAGCGAAG CGGCGGGAAC CGACCGCGGG AGGGAGATAG CGGCCGACCG CGAGAGCCTA
CCGGTAAATG CCGTCTACCT GGTCGTCCTC TGGTTCGTCG TGCCGATCCT CGTTCCGATC
GCCCTCTCGC ACGTCGTGAC GCCGATCTTC GTGGACCGGT ACTCGATCGG CGCGTCGCTG
GCGTTTTTCC TCCTGATCGC AGTCGGGATC CGGACGCTCT CCCGGCCGTC GCTTCGGTAC
GTCGTCGTCG GGGTGCTCCT CGTGGGTCTC GTGGCGCCGC TCCCGACGTA CTACCAGGAC
GACCAGAAGG AGCAGTGGCG GGGGGCCGCC GCCGACGTCG AGTCGGCCGT CGACGGCGAC
GACGTCGTCC TCGTGAGCAG ACCGTTCACG GAGCGGACGT TCGGCTTCTA CTTCGACCGG
TCGGACGTGC CAACGGTCCG GATCCCGCCC GACGCGTCGG GCGACGAGAT ACGGTCGGCC
GTCGACGGCC ACGACGACGT CTGGATCGTG CTCTCGTACA CCAGTTCGTC GACCAACCAG
CGGATCGTCG ACGCCGTGGC GAACCGCGAC GACTACCGCG GCCCCGTCGA GGTGAACCGG
TACAACGGCA TCGCTGTGGT CCGGCTCGAG CGGACGTCGA ACGGCGGCTA G
 
Protein sequence
MSPSLPIETG GEGNRDRSLA GVPVELYPIV LIATALRLFR LESESYWVDE VVSVTTVTSN 
TPIELLISVP GNDPHPPFYY LLLSGWTALF GTSELATRLL SALAGIATVV VLYGIGRRLF
DREVGAIAAV LVAVSPFHVW YAQEVRMYNL LALLTALSVY YFVRIQTDRA TDESGFRTEI
GYVVSTVLLG YTHVFGLFVI LAQNAYVFSR PLVRIVPRSR LTLRRWFELE ALTALLLAPW
LVKLVRRMLA ARAGETTNVS WIPLPTAETV RETFAAYLGA YLFEESFPLL VSLVVVGCLV
LALSSGRYVA TEPGTSAEAD RSEAAGTDRG REIAADRESL PVNAVYLVVL WFVVPILVPI
ALSHVVTPIF VDRYSIGASL AFFLLIAVGI RTLSRPSLRY VVVGVLLVGL VAPLPTYYQD
DQKEQWRGAA ADVESAVDGD DVVLVSRPFT ERTFGFYFDR SDVPTVRIPP DASGDEIRSA
VDGHDDVWIV LSYTSSSTNQ RIVDAVANRD DYRGPVEVNR YNGIAVVRLE RTSNGG