Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0116 |
Symbol | |
ID | 8740679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 126077 |
End bp | 128257 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646510679 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003401690 |
Protein GI | 284163411 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCGG CCACGACTGC CTGCCGCCTC GCCGAGCTAT ATGGGCCGGC GGGTACGACC TCGAGTATGA ACAGGATCGA GGCGGTCGAC TACCACGAGA TCACCCACGT CGAGGAGCCG CGGCTCTCGC CGAGCGACGA GCGGGTGGCG TTCGTCCGTC GGACGCCCGC CGCCGACGAA TCGTACGCGG CGTCGATCTA CACCGTCCCC GTCGGCGGCG ACGAGGCCGC GCAGTTCACC GCGAGTGACG GCGTCGATTC CCAGCCCCGC TGGAGCCCCG ACGGCGATCG ACTCGCCTTC GCCAGCACCC GTGGCGAGGG CGACCGCGAG CAGGTCTGGC TCGTCCCGAC CGACGGCGGC GAAGCGCGCC GGCTCACGTC GGTCGTCGGC GGGATCGACG ACCTCGAGTG GAGTCCGAAC GGCTCCAGAC TACTCTTCTC CCAGCGCGTC GCGCCCGAGG ATCGCGAGGC CGGTCGGGAT CGCACGGTCG ACCCCGACTA CGAGCGCGAG ACCCCCGATC CGCGGGTGAT CGACCGCACG ATCTACCGGG CCGGAACCGA ATACATCGAC GGCCGGCGGC GCCACGTCTA CGTCCTCGAG GTCGAGGCCG CGCTCTCGAG GGACCCGGAG GACAATCCGG ACGGAACGGC GATCACCCGC CTCACCGACG AGGACGGGCC GGCCGACGTG CCCGTCGACT ACGTCTCTCC GACGTGGGGC GACGACGAGA CGATCTACTA CGCGGCCAAG GCCGCCGCGG CCGGCGAGGA TCCCGACGAC ACGCTGGCCT ACGACCTGTA CGAACACGGG ACGGACTCCG GCGAGATTGA GGCGTTCACG CAAACCACGG GCTGGCTCGA GTCCGGATCG ATCGACGCCA CCGCGGACGG TCGCGTCGCC TTCGAGTTCA CGCCCGAGGA CCGAACTTCG ATGCGCCAGA CCGAGATCCG CGTACACGAC CGCGAGACTG GCGAGGAGCG GACGCCGACG GAACCGCTCG ATCGGACCGT GGGCCACCGC TGTGGCTTCG AGTGGGCGCC CGACGGCGAG ACGCTGTACG TCACGACGCC CGACGAGGGC TCGCGCGTCT GCTGGTCGGT GCCCGGCGAC GCGAGCGAAG ACCCGACGCG AGTCTACGGC GACGGCGTCA CGATCGCGGA CTTTTCGGTC GGCGAGAACG CCGTCGCCTA CGTCTACAGC GAGTGGGACC ACCCCGGCGA TGTCTTCGTG ACGACCCGTG GCGGCAACGA GGTCCACCGG CTGACTCGAG TGAACGACGA TTACCTCGCG GATCGCGCGG TTCGTCAACC CGAAGAAGTG TGGTTCAAGA CCGACGACGG GACGGAGAGT CAGGGCTGGC TGCTGACGCC CCCCGAGTTC GACGCCGACG CGTCGCCGGG TGAGCGGTAC CCCCTCGTCG TCGAGGTTCA CGGCGGTCCC CACGCCCACT GGACGACCGC GGGGACGATG TGGCACGAGT TCCAGACGCT CGCGGCGCGA GGGTACGTCG TCTTCTGGTG CAACCCGCGA GGTTCGACGG GGTACGGCGA GGACCGCGCG ATAGCCATCG AGGGCGACTG GGGCGAGATC ACGCTGACGG ACGTGCTCGC CGGCGTCGAG ACGGTCTGCG AGCGTGACTT CGTGGACGAC GGCGAGGTGT TCGTCACCGG CGGCAGCTTC GGCGGGTTCA TGACCGCGTG GGCGGTCGCC CACAGCGACC GCTTCGAGGC GGCGGTCTCC CAGCGAGGCG TCTACGATCT CACCGGATTC TACGGCTCGA GCGACGCGTT CACACTCGTC GAAGACGATT TCGGGACGAC GCCCTGGGAC GACCCCGACT TCCTGTGGAA CCAGTCGCCC GTCGCCCACG TCGCCGACGT CGACGCGCCC ACGCTCGTGT TGCACTCCGA TCAGGACTAC CGGACGCCCG CCAACACGGC CGAACTGTTC GTCCGCGGAC TGCAGAAACA CGGCGTCGAG ACGCGGCTGG TCAGGTATCC TCGCGAGGGC CACGAACTCT CCCGGTCGGG CGAACCCGCC CACGTCGTCG ATCGACTCGA GCGCATCGCC CGCTGGTTCG ACGGCTACTC AGCGTATCAC GAGTCTTCAC CGGCGCTCGA GCGCGACCGT GACGCCGGGC TCTCGAGCGG GGACGAAGAC GGCGACCGGA ACGAGGAGTG A
|
Protein sequence | MNAATTACRL AELYGPAGTT SSMNRIEAVD YHEITHVEEP RLSPSDERVA FVRRTPAADE SYAASIYTVP VGGDEAAQFT ASDGVDSQPR WSPDGDRLAF ASTRGEGDRE QVWLVPTDGG EARRLTSVVG GIDDLEWSPN GSRLLFSQRV APEDREAGRD RTVDPDYERE TPDPRVIDRT IYRAGTEYID GRRRHVYVLE VEAALSRDPE DNPDGTAITR LTDEDGPADV PVDYVSPTWG DDETIYYAAK AAAAGEDPDD TLAYDLYEHG TDSGEIEAFT QTTGWLESGS IDATADGRVA FEFTPEDRTS MRQTEIRVHD RETGEERTPT EPLDRTVGHR CGFEWAPDGE TLYVTTPDEG SRVCWSVPGD ASEDPTRVYG DGVTIADFSV GENAVAYVYS EWDHPGDVFV TTRGGNEVHR LTRVNDDYLA DRAVRQPEEV WFKTDDGTES QGWLLTPPEF DADASPGERY PLVVEVHGGP HAHWTTAGTM WHEFQTLAAR GYVVFWCNPR GSTGYGEDRA IAIEGDWGEI TLTDVLAGVE TVCERDFVDD GEVFVTGGSF GGFMTAWAVA HSDRFEAAVS QRGVYDLTGF YGSSDAFTLV EDDFGTTPWD DPDFLWNQSP VAHVADVDAP TLVLHSDQDY RTPANTAELF VRGLQKHGVE TRLVRYPREG HELSRSGEPA HVVDRLERIA RWFDGYSAYH ESSPALERDR DAGLSSGDED GDRNEE
|
| |