Gene Htur_5274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5274 
Symbol 
ID8745922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013749 
Strand
Start bp5346 
End bp6803 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content56% 
IMG OID646515728 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003406675 
Protein GI284176400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones110 
Plasmid unclonability p-value0.0502188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAG GGGAACGCAA AGTCTCGAGA CGAGCAGTGC TCGCAGCCGC CGGGGGAGTC 
ACCGCTGTTG GAACAGCGCT GAGTCAATCC AGCCAGAAAA AACTCGGAAA TCTGTTCGGG
GGGACCATGG GGCTTAACGA AGCCTACAAC GTGAGCGGCG ATCTCTGGAT CGGCCCGGAC
TCGGCGAAGG GCGACGTCGC TGCCGAGAGT GGACGGGTCT ACATGGCGGT AGATACGCAA
GTCGAATATT ACGGCACCGG AGACAGCTGG GACGGAATGG GGCTTGGGAG CGCCTCGAAC
CCAGTCTCCG AGATCCACAG CGAATCGGTA AGTACAGAAG AAGCCGTTAC CGCTCGCCAT
AAAAGACTGG GGAGCGTTAC TACGACACCG AGTTATTGGG GTCCAGACCT TCAACTGGAT
AACGTCTATC AGACGTTTGA CTCGCAGCAC GTCAAAGGCC GTCTGATTGG CAAATACTAC
TGGTCTGACC CTCCGTATGG CACCGAAAAC GCGGGTGTGG ACTCATACGA TCCCGACGAC
GGGGCAAACA ACAACGGGGT TGCTATGGAT GGGGACGCAA CACCGTGTCT GGTAGCCGAC
GATTCGATTT TCGTTTTCAA AGACAAGGAC GAAACGAACG GTCTTCCGGT TCGGATCTAC
CGCGCTCCAG ACTATCAAAC CACGTTCGAC AACAGCGGCA CGCTCCAGTA TGAGCGTGTG
CTTGATTTTG ATGACGGTGG GGCGACGATG ATCGGCAAAG GTGGGCGATT TAACTACCAA
ATTTCAGCCC ATCCTGATTC GGGGACTATC GTTGTTGGCG AATACAATCA GACGGAAGGG
GCTGACCCGA AACTCTACCG CTCGACGGAC AACGGCCAAA CGTGGAGTGT CGTTTATCAA
GAGACAGGGG TCCCGGATCA CGTCCACTCG GTCGCTCCAG ATCCATACGA GCCGGATCAC
TGGATTGTCT GCCTCGGAGA CAACGGCGAA GATCGGTATC TCGAGTCGTG GGACGATGGT
GCGACTTGGT CGCGAGAACC GCTTGGATTC GACCCCAAAA CGCACGCTGT TGGGATTGAC
TACGGTCCTG ATTACATCTA CTTCGCGCAA GACCAAGGCT CAGGGTCCTA TCATGGTCCG
TGGGTCGTCC GGCGCGAAGA TCGCGAGATG TTCTCCCTCT GTTCAACGAA CCCACGGTTC
AAGTACCGGG GCGTCGTTGA CGGCATGATC GAGCTCGGTC TCATCCACGA CAGGGCTCAC
GGAATCACGT ACATTCGTGG GCGAGACGGG ACAAATGGCA CGAATTACGT GTACTATGTC
GACGGAATTG GAGGAGAACC ACAGATGATT GACGATGCAT GGGGCTCACC ACAAATGCAC
CCTGTTGATG GCTACATATC GTCCGCTCAG GGGAAGATGT ACGCCCGACT GTCGCCGGTT
CCTGCCAATA AATTGTAA
 
Protein sequence
MSEGERKVSR RAVLAAAGGV TAVGTALSQS SQKKLGNLFG GTMGLNEAYN VSGDLWIGPD 
SAKGDVAAES GRVYMAVDTQ VEYYGTGDSW DGMGLGSASN PVSEIHSESV STEEAVTARH
KRLGSVTTTP SYWGPDLQLD NVYQTFDSQH VKGRLIGKYY WSDPPYGTEN AGVDSYDPDD
GANNNGVAMD GDATPCLVAD DSIFVFKDKD ETNGLPVRIY RAPDYQTTFD NSGTLQYERV
LDFDDGGATM IGKGGRFNYQ ISAHPDSGTI VVGEYNQTEG ADPKLYRSTD NGQTWSVVYQ
ETGVPDHVHS VAPDPYEPDH WIVCLGDNGE DRYLESWDDG ATWSREPLGF DPKTHAVGID
YGPDYIYFAQ DQGSGSYHGP WVVRREDREM FSLCSTNPRF KYRGVVDGMI ELGLIHDRAH
GITYIRGRDG TNGTNYVYYV DGIGGEPQMI DDAWGSPQMH PVDGYISSAQ GKMYARLSPV
PANKL