Gene Huta_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2284 
Symbol 
ID8384581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2333304 
End bp2334407 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content69% 
IMG OID644973356 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003131184 
Protein GI257053351 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.135107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCAG AGACATCTAC ACGGCGGCGG TTTCTGGCGC TCGGCGGAAC GGCACTCGCG 
TCCGCGATCG CGGGGTGTTC GACGTCGCTG TCGGCACCCG AGTCGTCGCC AGGGGAGTCG
CCGTCGCCCT CCGCAGCCGG AGAGTCTCCC GAGACGGACA CAGCCGAGAG TCCCTACACC
CGGGTGTACC GCGAGACCAT CCCCTCGGTC ACGACCGTCC AGACGACGAC CGGCCAGGGA
ACCGGCTTCC AGTACGACGA GGGGCACGTC GTCACGAACG CCCACGTCGT CGGGACGGCG
AGTGAGGCGC AGGTCCGCTT CCACGACGGG ACGTGGGCGA GTGGCCCGGT GGTCGGGACG
GATCCACACA GCGACCTCGC CGTGATCGAA CCCGGGACAG TGCCCGATTC CGCAGCGCCG
CTCCCGTTCG GCGACCAGCC GCCGACGATC GGCCGGGAGG TCGTCGTCAT CGGGAACCCG
TACAACCTCG ACGGGTCGGT CACGTCGGGG ATCGTCAGCG GCACCGACCG GCTGATCCCG
TCGCCGGCGG GCTATCAGAT TCCGGACGCG ATCCAGACCG ACGCGGCGGT GAACCCGGGC
AACAGCGGCG GCCCGCTGAT GGATCTCGAC GGTTCCGTGG TGGGCGTCGT GAATTCGAAG
CAGGGCGACA GCATCGCCTT CGGGATCTCG GCTGCGCTGA CGCAGCGCGT GGTCCCGGAA
CTCATCGAGT CGGGAACCTA CGAGCACGCC TACATGGGCG TCTCACTCGA CGTGGTCACG
CCGCGGCTCG CAGAGGCGAA CGACCTGGCG GAGCCACAGG GCCTGCTGGT CGTCCAGACC
GTTCGTGGTG GCCCGGCTGA CAGCGTCCTC CAGCCCAGTA GCATCGAGTA CGTCGGTGGA
CGCCAGGTCC CCGTGGGTGG CGACGTTATC CGCGCCATCG ACGGGACGCC GATGGAGACC
TTCGAGGACC TGGCGAGCTA TCTGGCGCTG GAAACCCGGC CCGGTGATAC CATCGAGGTC
GCGATCCTGC GGGACGGTGA CGAGCGAACC GTCGAGGTGA CGCTCGATGC CCGCCCGGAC
CGCTCGACGT CACCGCTCCG TTGA
 
Protein sequence
MTPETSTRRR FLALGGTALA SAIAGCSTSL SAPESSPGES PSPSAAGESP ETDTAESPYT 
RVYRETIPSV TTVQTTTGQG TGFQYDEGHV VTNAHVVGTA SEAQVRFHDG TWASGPVVGT
DPHSDLAVIE PGTVPDSAAP LPFGDQPPTI GREVVVIGNP YNLDGSVTSG IVSGTDRLIP
SPAGYQIPDA IQTDAAVNPG NSGGPLMDLD GSVVGVVNSK QGDSIAFGIS AALTQRVVPE
LIESGTYEHA YMGVSLDVVT PRLAEANDLA EPQGLLVVQT VRGGPADSVL QPSSIEYVGG
RQVPVGGDVI RAIDGTPMET FEDLASYLAL ETRPGDTIEV AILRDGDERT VEVTLDARPD
RSTSPLR