Gene Htur_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_0739 
Symbol 
ID8741322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp760051 
End bp761166 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID646511318 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003402309 
Protein GI284164030 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAC ACACCCCGGA TCGTCGCGAG TTCCTCGCGC TCGCCGGTAC CGGCCTCGTC 
GGCGCCGTCG CCGGCTGTAC CGAGCCCACC GCCAGCGACT CGATGGAGGG CTCGTCCTCG
CACTCGATTT CGCAGGAGAT CGATCCCGAC AAGCGCGCGG ACGGCTCGAC GTACACCGAC
GTCTACGAGG CGGTCATCGA CTCGGTCGCA CAGGTGCGGG CGCTAGGCGC CGGGAGCCCG
TACGGTGGCG ACCGGAGCGG CGGCCAGGGC TCGGGCTTTC TCGTCGACGA CACCCACCTC
GTCACGAACG AGCACGTCGT CGCCGGCGCC GACACCGTCG ACCTCCAGTA CATCAACGGC
GACTGGTCGG CCACCAAAAT CGTCGGCGCC GACTTCTACA GCGACCTGGC CGTCCTGAAG
GTCGATCACG TCCCCGACGA GGCGACGCCC CTCGAGTTGG CCGCCGAGCG CTCCGTCGTC
GGTCAGGAGG TGCTCGCGAT CGGCAACCCC TACGGGTTCG AGGGCTCCAT GTCGAAGGGG
ATCGTCAGCG GCGTCAATCG TACGCTCGAC ATGCCGGACC GGACGTTTTC GTTTTCGAAC
GCGATCCAGA CCGATGCCGC GGTCAACCCC GGCAACAGCG GCGGGCCGCT GGTCAACCTG
GACGGCGAAG TCGCTGGCGT CATCACCGCC GGCGGCGGGG ACAACATCGG CTTCGCGATC
CCGTCGGCGG TCGCGAGTCG AGTCGTCCCC TCGCTGATCG AGACCGGAAC CTACGACCAC
TCCTACATGG GGATCACCCT CGCAACCGTC GACCGGTATA TCGCCGAGGC CAACGACCTC
CCCGAGGCGA CCGGCGTCAT CGTCACGGGG GTCGAATCCG GTGACCCGGC CGACGGCGTC
CTCCGGGCCG CAACGCCCCG CCCGCGCGAC TCGATCCCCG TCGGGGGCGA CGTCATCTAC
GCCATCGACG GCGAGCCGAT CCCCGACCGC CACGCGCTCT CGAGCCACCT CGCCTTGCGG
ACCAGTCCGG GGGATACGAT CGAGATCGAG CGCTGGCGCT ACGGCGACGA GACCACGGTC
TCGCTGACGC TCGGGGAGCG ACCGTCGGCC AACTGA
 
Protein sequence
MNGHTPDRRE FLALAGTGLV GAVAGCTEPT ASDSMEGSSS HSISQEIDPD KRADGSTYTD 
VYEAVIDSVA QVRALGAGSP YGGDRSGGQG SGFLVDDTHL VTNEHVVAGA DTVDLQYING
DWSATKIVGA DFYSDLAVLK VDHVPDEATP LELAAERSVV GQEVLAIGNP YGFEGSMSKG
IVSGVNRTLD MPDRTFSFSN AIQTDAAVNP GNSGGPLVNL DGEVAGVITA GGGDNIGFAI
PSAVASRVVP SLIETGTYDH SYMGITLATV DRYIAEANDL PEATGVIVTG VESGDPADGV
LRAATPRPRD SIPVGGDVIY AIDGEPIPDR HALSSHLALR TSPGDTIEIE RWRYGDETTV
SLTLGERPSA N