Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2151 |
Symbol | |
ID | 8384445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2201338 |
End bp | 2203239 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644973220 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003131051 |
Protein GI | 257053218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.160132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCTT CTCCGTCAAA ACTGGCGGGC CCAGTGCGAC GGAAGGTGGG GGAAGGGGAG CGAAAACAGA CGGCCGATTC TGACAAAGTA CTGCCGCTGT ACGTCAGAGC GGAACCGGGG AAGGTTTCGG CTGCCAAGCG AGCAATAGAG AAGACCGGCA GGGAGATCCG ATCGGTGGAC GCCGGATACA TCGCGGTCGA CCTGCCGCCG AAAAGTACGC TTACGATCGC GGAATCGGAC GCGGTGCGTC ACATCCAGGA ACGCCACACG CCGCGTGCAC ATCAGGTACC GGAGCGGAAC ATTTCAGAGG GCGTCGGCGT GATGCACGCC GACACCCTCC ACGATGAAGG CGTCACGGGT GACGGCGCCC GCATCGCCGT CATCGACCAC AGGTTCCACA CCGACAACCC GAAATATTCA GATAGAATCG TTGCGACTGT CGGCGATAGT ACGTATTTCA CGTCTGATAG TGAATACAAC GATACAAGTT ACGAAGGGCC AACGGAACAA CACGGAACGG CCTGTGCCGA ACTGGTCGCC GACGTCGCCC CCGACGCGGA ACTGGTTCTC GCGACGACCA TCGGCCCGCA ATCGTTCGGC CAAATCATGA ACGAGATCGA AAGTTACGAT CCGGACGGGG CGACGATGTC GCTGGGGTAT TATACCGGAC TCCGCATCGA CGGCGAGGAC CCGATCAGTT CGCGAATCGA TCAGTTCACT GACGGCGGGC GGCTGTTTGC GAACTCCGCC GGCAACGAGG CGAACGCACA CTGGGACGGA CAGTTCGAGA ACGACGGCAA CGATCTGATG GTCTTCGACA GTTCGCTGTC GACGCCCACA CGGTTCCCCG TGGAAATGCC CTATTCGGGC AGCGAGATCC ACGTCCACTG GGATGCCGAC TGGAGCCAGG ACGACCAGCG CTACAAAGTC CGGGTGTACG ACAACGAAGA CGATCCAGTC GGTGATAGCT CGGCACTCCT CACTGAACAG ACGACCGATC CCGTCGAGAT CATCTCGGCA CCGTCGTCGG GAAGTAACCC GTACCATCTC GAAATCGAGA AGGTCGACGC CACCGGCGAC GAGCACTTCG ACATGTTCAC CTGGTACTCG TCGCTCGGTC GCACAACCGC GCGTCGAAGC ATCGGGATCC CGGCGACGAG TCCCGACGAA AATCTGCTTT CCGTGGCGGC CGTACAGGCA ACGGAGTACG GTCGGACCAG CGAGGAACAT CTCAAGCCCT ACTCCTCGCA GGGGCCGACA CAGGACGGGC GACGGGGGAT CGACATCGCC GCACCGTCGA TGGTTTCGAC GACCGACGAG GGTGAATACG GCGCGTATGG CGCACTCGAA GACGGTGGCG GCTTCAACGG GACATCGGCA GCCTCACCAC ATGTCGGCGG GGCGTTCGGG TTGCTGTTCG GCTCGGCGAT CAGCGCCAGT CCAGTCCAAG CGCGTGACGC ACTGTTCGAT ACGGGACGGT CGATCGTCGA TTCCGATGTC GCCGAGCCCG GCGAGAACAA CACCAAGATC GGTCACGGAT ACACCGACGT CGCGGCCGCT CAGGAGTGGT CGACGTCGAT TCATGCGACC GGCGACGTGA TCTCGCCCGG GGAACGCGCG ACGATAACAG CCGCGGGAAG CGACATCGAG AACATTACCG TAGCGGACCT CTGGACGGAC TGGTCGGTCG ACTCGACGCA ACCCGACGGC GGGACCTTCA GTGACGACGT TGCGTCCGCC GGGACGGGGT CATTCTCGTG GGATTCGACG CAGTCGTCGG TCTCCGTGTC GCTGACTGTC GACGTGCCGA GTCGCTACGT CGGCGGTACG TATGTGGTGG ACGTGATCGG TCAGAAATCC GGATCGCCCG TCGAGAAGAC GGTCCAGATC GATATCTCCT GA
|
Protein sequence | MSSSPSKLAG PVRRKVGEGE RKQTADSDKV LPLYVRAEPG KVSAAKRAIE KTGREIRSVD AGYIAVDLPP KSTLTIAESD AVRHIQERHT PRAHQVPERN ISEGVGVMHA DTLHDEGVTG DGARIAVIDH RFHTDNPKYS DRIVATVGDS TYFTSDSEYN DTSYEGPTEQ HGTACAELVA DVAPDAELVL ATTIGPQSFG QIMNEIESYD PDGATMSLGY YTGLRIDGED PISSRIDQFT DGGRLFANSA GNEANAHWDG QFENDGNDLM VFDSSLSTPT RFPVEMPYSG SEIHVHWDAD WSQDDQRYKV RVYDNEDDPV GDSSALLTEQ TTDPVEIISA PSSGSNPYHL EIEKVDATGD EHFDMFTWYS SLGRTTARRS IGIPATSPDE NLLSVAAVQA TEYGRTSEEH LKPYSSQGPT QDGRRGIDIA APSMVSTTDE GEYGAYGALE DGGGFNGTSA ASPHVGGAFG LLFGSAISAS PVQARDALFD TGRSIVDSDV AEPGENNTKI GHGYTDVAAA QEWSTSIHAT GDVISPGERA TITAAGSDIE NITVADLWTD WSVDSTQPDG GTFSDDVASA GTGSFSWDST QSSVSVSLTV DVPSRYVGGT YVVDVIGQKS GSPVEKTVQI DIS
|
| |