Gene Htur_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0116 
Symbol 
ID8740679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp126077 
End bp128257 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content69% 
IMG OID646510679 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003401690 
Protein GI284163411 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGG CCACGACTGC CTGCCGCCTC GCCGAGCTAT ATGGGCCGGC GGGTACGACC 
TCGAGTATGA ACAGGATCGA GGCGGTCGAC TACCACGAGA TCACCCACGT CGAGGAGCCG
CGGCTCTCGC CGAGCGACGA GCGGGTGGCG TTCGTCCGTC GGACGCCCGC CGCCGACGAA
TCGTACGCGG CGTCGATCTA CACCGTCCCC GTCGGCGGCG ACGAGGCCGC GCAGTTCACC
GCGAGTGACG GCGTCGATTC CCAGCCCCGC TGGAGCCCCG ACGGCGATCG ACTCGCCTTC
GCCAGCACCC GTGGCGAGGG CGACCGCGAG CAGGTCTGGC TCGTCCCGAC CGACGGCGGC
GAAGCGCGCC GGCTCACGTC GGTCGTCGGC GGGATCGACG ACCTCGAGTG GAGTCCGAAC
GGCTCCAGAC TACTCTTCTC CCAGCGCGTC GCGCCCGAGG ATCGCGAGGC CGGTCGGGAT
CGCACGGTCG ACCCCGACTA CGAGCGCGAG ACCCCCGATC CGCGGGTGAT CGACCGCACG
ATCTACCGGG CCGGAACCGA ATACATCGAC GGCCGGCGGC GCCACGTCTA CGTCCTCGAG
GTCGAGGCCG CGCTCTCGAG GGACCCGGAG GACAATCCGG ACGGAACGGC GATCACCCGC
CTCACCGACG AGGACGGGCC GGCCGACGTG CCCGTCGACT ACGTCTCTCC GACGTGGGGC
GACGACGAGA CGATCTACTA CGCGGCCAAG GCCGCCGCGG CCGGCGAGGA TCCCGACGAC
ACGCTGGCCT ACGACCTGTA CGAACACGGG ACGGACTCCG GCGAGATTGA GGCGTTCACG
CAAACCACGG GCTGGCTCGA GTCCGGATCG ATCGACGCCA CCGCGGACGG TCGCGTCGCC
TTCGAGTTCA CGCCCGAGGA CCGAACTTCG ATGCGCCAGA CCGAGATCCG CGTACACGAC
CGCGAGACTG GCGAGGAGCG GACGCCGACG GAACCGCTCG ATCGGACCGT GGGCCACCGC
TGTGGCTTCG AGTGGGCGCC CGACGGCGAG ACGCTGTACG TCACGACGCC CGACGAGGGC
TCGCGCGTCT GCTGGTCGGT GCCCGGCGAC GCGAGCGAAG ACCCGACGCG AGTCTACGGC
GACGGCGTCA CGATCGCGGA CTTTTCGGTC GGCGAGAACG CCGTCGCCTA CGTCTACAGC
GAGTGGGACC ACCCCGGCGA TGTCTTCGTG ACGACCCGTG GCGGCAACGA GGTCCACCGG
CTGACTCGAG TGAACGACGA TTACCTCGCG GATCGCGCGG TTCGTCAACC CGAAGAAGTG
TGGTTCAAGA CCGACGACGG GACGGAGAGT CAGGGCTGGC TGCTGACGCC CCCCGAGTTC
GACGCCGACG CGTCGCCGGG TGAGCGGTAC CCCCTCGTCG TCGAGGTTCA CGGCGGTCCC
CACGCCCACT GGACGACCGC GGGGACGATG TGGCACGAGT TCCAGACGCT CGCGGCGCGA
GGGTACGTCG TCTTCTGGTG CAACCCGCGA GGTTCGACGG GGTACGGCGA GGACCGCGCG
ATAGCCATCG AGGGCGACTG GGGCGAGATC ACGCTGACGG ACGTGCTCGC CGGCGTCGAG
ACGGTCTGCG AGCGTGACTT CGTGGACGAC GGCGAGGTGT TCGTCACCGG CGGCAGCTTC
GGCGGGTTCA TGACCGCGTG GGCGGTCGCC CACAGCGACC GCTTCGAGGC GGCGGTCTCC
CAGCGAGGCG TCTACGATCT CACCGGATTC TACGGCTCGA GCGACGCGTT CACACTCGTC
GAAGACGATT TCGGGACGAC GCCCTGGGAC GACCCCGACT TCCTGTGGAA CCAGTCGCCC
GTCGCCCACG TCGCCGACGT CGACGCGCCC ACGCTCGTGT TGCACTCCGA TCAGGACTAC
CGGACGCCCG CCAACACGGC CGAACTGTTC GTCCGCGGAC TGCAGAAACA CGGCGTCGAG
ACGCGGCTGG TCAGGTATCC TCGCGAGGGC CACGAACTCT CCCGGTCGGG CGAACCCGCC
CACGTCGTCG ATCGACTCGA GCGCATCGCC CGCTGGTTCG ACGGCTACTC AGCGTATCAC
GAGTCTTCAC CGGCGCTCGA GCGCGACCGT GACGCCGGGC TCTCGAGCGG GGACGAAGAC
GGCGACCGGA ACGAGGAGTG A
 
Protein sequence
MNAATTACRL AELYGPAGTT SSMNRIEAVD YHEITHVEEP RLSPSDERVA FVRRTPAADE 
SYAASIYTVP VGGDEAAQFT ASDGVDSQPR WSPDGDRLAF ASTRGEGDRE QVWLVPTDGG
EARRLTSVVG GIDDLEWSPN GSRLLFSQRV APEDREAGRD RTVDPDYERE TPDPRVIDRT
IYRAGTEYID GRRRHVYVLE VEAALSRDPE DNPDGTAITR LTDEDGPADV PVDYVSPTWG
DDETIYYAAK AAAAGEDPDD TLAYDLYEHG TDSGEIEAFT QTTGWLESGS IDATADGRVA
FEFTPEDRTS MRQTEIRVHD RETGEERTPT EPLDRTVGHR CGFEWAPDGE TLYVTTPDEG
SRVCWSVPGD ASEDPTRVYG DGVTIADFSV GENAVAYVYS EWDHPGDVFV TTRGGNEVHR
LTRVNDDYLA DRAVRQPEEV WFKTDDGTES QGWLLTPPEF DADASPGERY PLVVEVHGGP
HAHWTTAGTM WHEFQTLAAR GYVVFWCNPR GSTGYGEDRA IAIEGDWGEI TLTDVLAGVE
TVCERDFVDD GEVFVTGGSF GGFMTAWAVA HSDRFEAAVS QRGVYDLTGF YGSSDAFTLV
EDDFGTTPWD DPDFLWNQSP VAHVADVDAP TLVLHSDQDY RTPANTAELF VRGLQKHGVE
TRLVRYPREG HELSRSGEPA HVVDRLERIA RWFDGYSAYH ESSPALERDR DAGLSSGDED
GDRNEE