Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5221 |
Symbol | |
ID | 8745769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | + |
Start bp | 116080 |
End bp | 117273 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646515578 |
Product | peptidase M24 |
Protein accession | YP_003406525 |
Protein GI | 284176248 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0493445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATCCC CGTTCAAACG CCGTCTCGAG GCGTGTCAAC GCCGACTCGA GCGCACCGAC GCCGCGCTGG CCGTCCTCGT TCCGGGCCCG AATCTCACCT ACCTGACCGG CTTCGAGGAA TCACCGTCGG AGCGACACCT GCTGCTGTTC GTCCCGCAGG TCGGCGATCC GGTCGTCGTC GCGCCGGCGA TGTACGACGC CCAGCTCCGG ACGCTGCCGA TCGAAACGCT CGCGGTACGG CTGTGGGACG ACGATGACGA CCCGCTCGAG GAAATCGAGG CGGTGCTGGC GGAGCTACTG CCGGCGGACG ACGAGCGATC CGGCTCGTCT CGGGACGACG CGCCGACGAT TCTCGTCGAC GACCGCATGT GGGCGACGTT CACACAGGAC CTGCGGGAGT GCGCGCCGGC GGCGACGTTC GACCTCGCCA GTCGTGTCCT CGAGGACCTC CGGATCCGGA AGGACGACGT CGAACTCGCG GCGCTCCGGC GGGCCGGCGA GATCGCGGAC CGCGTCTCGC TCGAGATCCG TTCCCGTGGA ACCGAGCTGG TCGGGCGAAC GGAAGCGGAA CTGGCGAACG AGATCGAGGG GCTGCTCGCC GAGTACGGCG GCGGCGAGCC GGCGTTCGAG ACGATCGTTG CGTCGGGACC CAACGGCGCC CGACCCCACC ACCACAGCGG CGACCGGGAG ATCGAGCGCG GCGACCCGAT CGTCCTGGAT TTCGGGGCGT TCGTCGACGC CGACCTCGAG GACGGGACGG GCCGCTATCC CGGCGACCAG ACGCGGACGA TCGTCGTCGG CGACGAGCCC GCAGACGAGT ACGAGCGGTA CGATCGGGTA CACGAGGTCG TCCGCGAGGC CCAGCAGCTC GCCGTCGAGA CCGTCGAACC CGGCGTGACG GCCGGCGCGG TCGATCGGGC GGCGCGATCG GTCATCGAGG ACGCCGGCTA CGGCGACGAA TTCGTCCACC GAACCGGTCA CGGCGTCGGC CTCGAGGTCC ACGAACCGCC CTACATCGTC GCGGACAACG ACCGCGAACT CGAGCCCGGG ATGGTCTTCT CCGTCGAACC GGGGATCTAC CTCGAGGGCG AGTTCGGCGT CCGGATCGAA GACCTGGTCG TCGTCACCGA GGACGGCGCT GAGCGGCTGA ACGAGTCGTC CCGCGGGTGG GAGACCGGCG AAATCCACTC GTAG
|
Protein sequence | MRSPFKRRLE ACQRRLERTD AALAVLVPGP NLTYLTGFEE SPSERHLLLF VPQVGDPVVV APAMYDAQLR TLPIETLAVR LWDDDDDPLE EIEAVLAELL PADDERSGSS RDDAPTILVD DRMWATFTQD LRECAPAATF DLASRVLEDL RIRKDDVELA ALRRAGEIAD RVSLEIRSRG TELVGRTEAE LANEIEGLLA EYGGGEPAFE TIVASGPNGA RPHHHSGDRE IERGDPIVLD FGAFVDADLE DGTGRYPGDQ TRTIVVGDEP ADEYERYDRV HEVVREAQQL AVETVEPGVT AGAVDRAARS VIEDAGYGDE FVHRTGHGVG LEVHEPPYIV ADNDRELEPG MVFSVEPGIY LEGEFGVRIE DLVVVTEDGA ERLNESSRGW ETGEIHS
|
| |