Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3188 |
Symbol | |
ID | 8743808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3276039 |
End bp | 3277877 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646513772 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003404726 |
Protein GI | 284166447 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAGCT ACGAGATCGA ACGCTACCTC AATATTCGAA GCGCCTACGG AACCTCCTTC GGTCCCGACG GCGAGCGCCT CTCCTTCCTG ATGAACACGA CCGGGACCCC GCAGGTCTGG ACGCTCGAGG AGCCGCGCGC GTGGCCCGAA CAGCGGACCT TCTACGACGA GCGGGTGACC TTCGCCTCGT GGTCGCCGGA GCGGCCGGAA CTGATCTTCG GGATGGACGA GGGGGGCAAC GAGCGCGCCC AGTTATTCAC ACTCGACGCC GAGACGGGCG AGATCGAGAA CGTAACGGCG ATGCCCGAGG CCAAGCACCG ATGGGGCGGC TGGAGCCACG ACGGCGAGCG GTTCGCCTTC GCCTCCAACC GCCGCGACGA GTCCGTTTTC GACATCTACG TACAGGATCG GGACGAAACG GGCGACGACG CCGACCTCGT CTACGAGGGC GACGGTTGGC TCTCGCTGTC GGGGTGGAGT CCCGACGACT CCCGGCTGCT GGTCTCGCAG GCGTACTCCA ACTTCGACCA GGACCTCTAC GTGCTCGACC TCGAGGACGA CGAGCCCGGC CTCGAGCACC TCACTCCCCA CGAAGGCGAC GTCCGCTATC AGAGCGCCAG CTGGGCCCCG GACGGCGAGG GAATCTATCT CGTCACGGAC GAGGGCGACG CCGACACGCT CTATCTCGCG TATCTCGACC TCGAGACGAA AGCGCTCGAA ACCGTCGCCG ACGGCGACGG ATGGAACGTC GGCGGCATCG CGCTGGACGA CGAGACCGGC CGGTTCGTTT ACTCCCGGAA CGTGGAGGGC TACACAGATC TCACCGTGGG TAAATTCGAC GAGAGCGATC CCACCGAGTT CGAGACGTTT CCCGAACCCG ATCTGCCGGG CGGGATCTCC GGCGGCGTGA GCTTCGATCC CGACGCCGAG CGCTTCGCGC TGTCGACGAC CGGCGACACG GTCAACACGA ACGTTTTCGT GGTCGACGTC GAGAGCGGCG AGACCGAGCA GTGGACGCAC GCACCGACGG CGGGCATCCC CTCTGAATCG TTCGACGAGT CCGACCTCGT CCACGTCGAA AGCTTCGACG GGCTGGAAGT GCCGGGCTTT CTCACGCTCC CTGACGACTA CGAGGAAGGG GATGCAAACG ACGGCGACGG CGTCCCCGTC ATCGTCGACA TCCACGGCGG CCCCGAGAGC CAGCGCCGCC CCTCCTTTTC CTCCGTCAAG CAATATTTCC TCGACCGAGG GTACGCCTAC TTCGAGCCGA ACGTCCGCGG CTCCGCGGGC TACGGCGCCA ACTACGCCGC GCTGGACGAC GTCGAGAAGC GGATGGATTC GGTCGCCGAC ATCGCGACCT GCGTCGAGTG GCTGCAGGAC CACCCCGCCG TCGACCCCGA TCGGATCGCC GCCAAGGGCG GCTCCTACGG CGGCTTCATG GTGCTGGCCG CGCTGACCGA GTACCCCGAC CTCTGGGCGG CCGGCGTCGA CGTCGTCGGT ATCGCCAACT TCGTCACCTT CCTCGAAAAT ACGGGCGACT GGCGCCGCGA ACTCCGCGAG GCCGAGTACG GCTCGCTGGC GGAGGACCGC GAGTTCTTAG AGGAGATCTC GCCGACGAAC AACATCGAGA ACATCGAGGC GCCGCTGTTC GTCCTCCACG GCGCGAACGA CCCGCGCGTC CCCGTCGGCG AGGCCGAACA GATCGCCGAG AAGGCCGAGC AACAGGGCGT CCCCGTCCGA AAGCTGATCT TCGAGGACGA AGGCCACGGC TTCTCGAAAC TCGAGAACCG CATCGAGGCC TACTCTGCGA TCGCAGACTT CCTCGACGAG CACGTCTGA
|
Protein sequence | MGSYEIERYL NIRSAYGTSF GPDGERLSFL MNTTGTPQVW TLEEPRAWPE QRTFYDERVT FASWSPERPE LIFGMDEGGN ERAQLFTLDA ETGEIENVTA MPEAKHRWGG WSHDGERFAF ASNRRDESVF DIYVQDRDET GDDADLVYEG DGWLSLSGWS PDDSRLLVSQ AYSNFDQDLY VLDLEDDEPG LEHLTPHEGD VRYQSASWAP DGEGIYLVTD EGDADTLYLA YLDLETKALE TVADGDGWNV GGIALDDETG RFVYSRNVEG YTDLTVGKFD ESDPTEFETF PEPDLPGGIS GGVSFDPDAE RFALSTTGDT VNTNVFVVDV ESGETEQWTH APTAGIPSES FDESDLVHVE SFDGLEVPGF LTLPDDYEEG DANDGDGVPV IVDIHGGPES QRRPSFSSVK QYFLDRGYAY FEPNVRGSAG YGANYAALDD VEKRMDSVAD IATCVEWLQD HPAVDPDRIA AKGGSYGGFM VLAALTEYPD LWAAGVDVVG IANFVTFLEN TGDWRRELRE AEYGSLAEDR EFLEEISPTN NIENIEAPLF VLHGANDPRV PVGEAEQIAE KAEQQGVPVR KLIFEDEGHG FSKLENRIEA YSAIADFLDE HV
|
| |