Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0395 |
Symbol | |
ID | 8740962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 423062 |
End bp | 424489 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646510961 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_003401968 |
Protein GI | 284163689 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCTG ACAGGCGACG CGTGCTGGCC CTGACCGGCG CCCTCCTCGC CGGCGCCGCG GGCTGTCTCG GTGACGTGAG CGACGCGGAG GATCCCGAAA ACGGCGACGA GCCCGCGAAC GGTGCCGAAG ACGGCGCGCT CTTCGACGAC TACGACGTCC CCGACTTTCC CCGACTGGAT CTCACCACTG ATCCCGACCT CGAGTCCGAC CGCCTCGCCG AACAGATCCG CGGAAACGTC GCCGTCTCCT TTGACGTCCT CGCACGGCTT CGCGAGGAGA CGCCCGGCGA GAACCTGTTT TTCTCGCCGT ACAGCATCTC GGTCGCGCTG GCGATGACCT ACGCGGGGGC TCGCGGCGAG ACCGCCGCGG AGATGGCCGA CGCTCTGCGC TACGACCTCG AGGGCGAGGC CCTCCACGCC GCCTTCGGCG CCCTCGAGGG TGAGTTCGAG CAACGAAACG AAGACGGTCG GGACGTCGAG ACGCCAGCGT GGGCCGACGA GGGCGGCGAG GGAGACGGGA GCGAGGGCGA CGAAGACGAC CTCGGATTCC AGCTCTCGAG CGCCAACGCC GTCTGGCGCG ACGAGGGACA CGACTTCGAC GACGCCTACG TCCAGTTACT CGAGGCCTAC TACGAGGCCG GCGACCACCT CGCCGACTTC TCGGGGAGCC CCGAGGCGGC CCGGGAGGAG ATCAACGCCT GGGTCGAGGA GCGAACGAAC GATCGGATCG AAGACCTCCT GCCGGAGGGC TCGATCGATG AGTGGACCCG GCTCGTCCTC ACGAACGCCG TCTACTTCCT GGCCGCCTGG GAGCACGACT TCGATCCCGC CGAGACGGAG CCAGCGACGT TCACGAGCCT CGACGGCAGC GAGACCGAGG TCGACCTGAT GCACCAATCG CAGGAACTGC GCTACGCCGA GATCGACGGC CACCAGCTCG TGGAGCTCCC CTACGCCAAC GGCGACACGA GCATGATCGT CGTCCTCCCA GCCGAGGGCG AGTTCGAGTC CTTCGAGGCG TCGTTCGGCG TCGACGAGCT GGCGATCATG CTCGAGGAGA CGTCGCAGCC AAAGGTCGAC CTCGCGCTCC CGAAGTTCGG CATCGAGTCG AAGTTCAGCC TCGTCGAGAC CATGCGGGAA TTGGGGATGG AACGTGCCTT CGACAACGAT GCGGACTTCA GCGGCATGGT CGAGGACGAC GACAGCGACC TGTACGTCGA CGACATCATC CACCAGAGCT TCGTCGAGGT CGACGAGGAG GGGACCGAAG CGGCGGCCGC GACGGCCGTC GTCATGGAGG ACACCGCCGT CGCGGACCGC GTCAAGATGA CCGTCGATCG GCCGTTCCTC TTCTACGTCC GCGACCGGCC GACCGAGACG CCGCTGTTCG TCGGCCGCGT CGTCGACGGC GAGCAGTTAC AGAACTAG
|
Protein sequence | MTADRRRVLA LTGALLAGAA GCLGDVSDAE DPENGDEPAN GAEDGALFDD YDVPDFPRLD LTTDPDLESD RLAEQIRGNV AVSFDVLARL REETPGENLF FSPYSISVAL AMTYAGARGE TAAEMADALR YDLEGEALHA AFGALEGEFE QRNEDGRDVE TPAWADEGGE GDGSEGDEDD LGFQLSSANA VWRDEGHDFD DAYVQLLEAY YEAGDHLADF SGSPEAAREE INAWVEERTN DRIEDLLPEG SIDEWTRLVL TNAVYFLAAW EHDFDPAETE PATFTSLDGS ETEVDLMHQS QELRYAEIDG HQLVELPYAN GDTSMIVVLP AEGEFESFEA SFGVDELAIM LEETSQPKVD LALPKFGIES KFSLVETMRE LGMERAFDND ADFSGMVEDD DSDLYVDDII HQSFVEVDEE GTEAAAATAV VMEDTAVADR VKMTVDRPFL FYVRDRPTET PLFVGRVVDG EQLQN
|
| |