Gene Htur_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5221 
Symbol 
ID8745769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013747 
Strand
Start bp116080 
End bp117273 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content70% 
IMG OID646515578 
Productpeptidase M24 
Protein accessionYP_003406525 
Protein GI284176248 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0493445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCCC CGTTCAAACG CCGTCTCGAG GCGTGTCAAC GCCGACTCGA GCGCACCGAC 
GCCGCGCTGG CCGTCCTCGT TCCGGGCCCG AATCTCACCT ACCTGACCGG CTTCGAGGAA
TCACCGTCGG AGCGACACCT GCTGCTGTTC GTCCCGCAGG TCGGCGATCC GGTCGTCGTC
GCGCCGGCGA TGTACGACGC CCAGCTCCGG ACGCTGCCGA TCGAAACGCT CGCGGTACGG
CTGTGGGACG ACGATGACGA CCCGCTCGAG GAAATCGAGG CGGTGCTGGC GGAGCTACTG
CCGGCGGACG ACGAGCGATC CGGCTCGTCT CGGGACGACG CGCCGACGAT TCTCGTCGAC
GACCGCATGT GGGCGACGTT CACACAGGAC CTGCGGGAGT GCGCGCCGGC GGCGACGTTC
GACCTCGCCA GTCGTGTCCT CGAGGACCTC CGGATCCGGA AGGACGACGT CGAACTCGCG
GCGCTCCGGC GGGCCGGCGA GATCGCGGAC CGCGTCTCGC TCGAGATCCG TTCCCGTGGA
ACCGAGCTGG TCGGGCGAAC GGAAGCGGAA CTGGCGAACG AGATCGAGGG GCTGCTCGCC
GAGTACGGCG GCGGCGAGCC GGCGTTCGAG ACGATCGTTG CGTCGGGACC CAACGGCGCC
CGACCCCACC ACCACAGCGG CGACCGGGAG ATCGAGCGCG GCGACCCGAT CGTCCTGGAT
TTCGGGGCGT TCGTCGACGC CGACCTCGAG GACGGGACGG GCCGCTATCC CGGCGACCAG
ACGCGGACGA TCGTCGTCGG CGACGAGCCC GCAGACGAGT ACGAGCGGTA CGATCGGGTA
CACGAGGTCG TCCGCGAGGC CCAGCAGCTC GCCGTCGAGA CCGTCGAACC CGGCGTGACG
GCCGGCGCGG TCGATCGGGC GGCGCGATCG GTCATCGAGG ACGCCGGCTA CGGCGACGAA
TTCGTCCACC GAACCGGTCA CGGCGTCGGC CTCGAGGTCC ACGAACCGCC CTACATCGTC
GCGGACAACG ACCGCGAACT CGAGCCCGGG ATGGTCTTCT CCGTCGAACC GGGGATCTAC
CTCGAGGGCG AGTTCGGCGT CCGGATCGAA GACCTGGTCG TCGTCACCGA GGACGGCGCT
GAGCGGCTGA ACGAGTCGTC CCGCGGGTGG GAGACCGGCG AAATCCACTC GTAG
 
Protein sequence
MRSPFKRRLE ACQRRLERTD AALAVLVPGP NLTYLTGFEE SPSERHLLLF VPQVGDPVVV 
APAMYDAQLR TLPIETLAVR LWDDDDDPLE EIEAVLAELL PADDERSGSS RDDAPTILVD
DRMWATFTQD LRECAPAATF DLASRVLEDL RIRKDDVELA ALRRAGEIAD RVSLEIRSRG
TELVGRTEAE LANEIEGLLA EYGGGEPAFE TIVASGPNGA RPHHHSGDRE IERGDPIVLD
FGAFVDADLE DGTGRYPGDQ TRTIVVGDEP ADEYERYDRV HEVVREAQQL AVETVEPGVT
AGAVDRAARS VIEDAGYGDE FVHRTGHGVG LEVHEPPYIV ADNDRELEPG MVFSVEPGIY
LEGEFGVRIE DLVVVTEDGA ERLNESSRGW ETGEIHS