Gene Huta_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1350 
Symbol 
ID8383627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1322165 
End bp1323430 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID644972411 
ProductSte24 endopeptidase 
Protein accessionYP_003130259 
Protein GI257052426 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.276793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTCT ATCATGCGCT CTTCGTCACG TTACTTGCTG GTACGACCGG CTTCTTTACC 
GCGCTCGCCG CGTTGAACGT CCGGCACGCT GAACGGACCG TCGACGAGGA GGGAGATTTC
GTGACTGATC GTCTCGGAAT CGACGATCCC GAGGAATTGC TCGCGTACAA CCGACTGGGA
ACGGCGCTGG GCCACCTCCA GACCTGGATG ATGCTCGTCG TCGTGTTGCT GGTACTGTAC
TCGGGCCTCT ATGCCGATGC CGTTGCGGCC CTGGAGGCGA CTGGCTGGCC ACCCTTCGTT
CGCGGGACTG TCTTCGTCGT TGGTACCGTC CTCGCGCTCC AGGCGCTCTC GCTCCCGTTC
GACGTCGTCG AGACGTTCGT CGTCGAGGAC CTCTTCGACT TCAATCAACA GACGCTACGG
CTGTACATCC GGGATCAACT CGTCTCGCTT CTCGTGATGG TGGTGCTGGT CGGCGTCCTC
GCTACGGCGG TGTTTCTCGC CATGGATGCT CTCGGCGAGT TGTGGTGGGT CGCCGCCTGG
GCGTTGTTCG TCGGCTTTTC CTTGCTCATG CAGGTCCTGT ACCCACGCGT GATCGCGCCG
CTGTTCAACG ACTTCGACCC GATCGAGTCC GGCGATCTCC ACGACGCAGT GACTGACGTC
TTCGATCGCG CGGGCTTCGA CACCGACGCG ATCTACGAGA TGGACGCCAG CCGTCGATCG
TCCCACGCCA ACGCCTACTT CATCGGCTTC GGTCGGACCA AGCGAGTCGT GCTGTTCGAC
ACCCTGATCG AGCAGTTGTC GATCCCCTCG GTGCAGGCAG TCCTCGCCCA CGAACTCGCC
CACTACGACC GTGGGCACAT CTGGAAGCAA CTCGGTGCGA GCGCCCTCTG GATGGGTGCC
CTGCTGTTCG GCGCGTCGCT GTTGGTCGAG GCCACGTGGC TCTATGAGAT GTTCGGCATT
GCCGGCCAGC CGGTCTACGC CGGTCTCGTG TTGGCCGTGC TGTGGCTCGT GCCCGTCGCC
CAGCTTTCGG CGCCGCTCAC CAACCGATTG TCACTCGCCC ACGAACGCGA GGCCGACGCC
TTCGCCGTCG AGGTGATGGG TGCCGAGCCG ATGGCCGACG CACTCGCCGA TCTGACGAGC
GAGAACCTCT CGAACCCGTT CCCGCACCCG CTCTATGAAA CCTTCCACTA CGATCACCCA
CCCGTGCCGA AACGGCTGCA GCACATCCGA ACGCTGGCCG ACGAATCCGC AAAAGCTGAG
GCTTAA
 
Protein sequence
MLVYHALFVT LLAGTTGFFT ALAALNVRHA ERTVDEEGDF VTDRLGIDDP EELLAYNRLG 
TALGHLQTWM MLVVVLLVLY SGLYADAVAA LEATGWPPFV RGTVFVVGTV LALQALSLPF
DVVETFVVED LFDFNQQTLR LYIRDQLVSL LVMVVLVGVL ATAVFLAMDA LGELWWVAAW
ALFVGFSLLM QVLYPRVIAP LFNDFDPIES GDLHDAVTDV FDRAGFDTDA IYEMDASRRS
SHANAYFIGF GRTKRVVLFD TLIEQLSIPS VQAVLAHELA HYDRGHIWKQ LGASALWMGA
LLFGASLLVE ATWLYEMFGI AGQPVYAGLV LAVLWLVPVA QLSAPLTNRL SLAHEREADA
FAVEVMGAEP MADALADLTS ENLSNPFPHP LYETFHYDHP PVPKRLQHIR TLADESAKAE
A