Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1973 |
Symbol | |
ID | 8384267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1996660 |
End bp | 1998471 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973043 |
Product | peptidase M50 |
Protein accession | YP_003130874 |
Protein GI | 257053041 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.246801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTACGG TAACGCAATC CTCTTTGCAG GAGCGGTCAT TCCTACTGAC AATGGTCGAT ACGCTCACGT TGGTTCTCGC GGGCGTTCTC GCGTACTCTC TGGGGGCGAC GGCGCTCGAT CGCCGGGGGT ATCTGCCCGC GTTCCTGAAA GTCTCCGGGC CCATCACGAC GCTCCATACC AAGCGTGGAC GGGCCGCACT CGATTGGCTC GCCCGGCCGA AGCGATTCTG GCGGGCCTTC GGGAACATCG GCGTCGGGTT CGGGCTATTC ATCCTCGCCG GGATGTTCCT GACAGTCCTG TTCTCGGGGA TCTCGAGTCT CCAGCAACCG GAAGCGAATC CGATCCAGGA GCCGAAAAAC GCCCTGGTCA TCCCCGGCCT CAACGACTTC CTCCCGCTTG CGGCCGCGCC GGAGATCATC TTCGGACTGC TCGTCGGGAT GATCGTCCAC GAGGGGGGCC ACGGCCTGCT CTGCCGGGTC GAGGACATCG ACATCGACTC GATGGGCGTT GCCCTCTTTA CGATCATCCC GCTGGGGGCG TTCGTCGAAC CCGACGAGGA AAGCCGGGCG AAAGCCGATC GCGGCGCGCA AACGCGAATG TTCGCGGCGG GAGTCACCAA CAACTTCGTG ATCACGGCAC TCGCCTTCCT CCTGCTCTTT GGCCCGGTCG CGGGGTCGAT CCAGGCCGTC GGCGGCGTCG CTGTCGGCGG TGCACTCCCT GGATCACCGG CCGCCGACGC GTCCCTCGGG GAAGGTGACG TCATCACGGG GATCAACGGG ACCGAGGTGA CCAACCAGTC GACGCTACGT GACGCACTCG GGGACGCCGA CGGGCGGACC GTCGCGGTGT CTCTCCACGA GGACGAAACG AAACGCATCC AGCGATCGGT GTTCGTCACC GTGGCCGTCC TCGACGGGCC GCTCGGGATC GATCGCGGTG ACACGATCAC GAGTGTCAAC GGAACGGCCG TCCACACGGT CAGCGGGCTG GTCGATGCCG TCGAGAACCG GACGGTCGCC ACACTCGAAA CGGCCGACGG AAATCAGACG ACCGGGCCGA TCGGCGCGTA CGTCAGCCGC GTCGCCGAGG GCGGGCCGTT CGCCGACGAC GGCGGGCCGG CCGGCGAGTC AGTCGTCATA ACGCGCTTCG ACGGAACGCG GATCATCGGA CAGTCACAGC TGCTCGACGC ACTCGAGGGG ACCGACCCTG GCGAGACGGT CGACATCGAA GCGTACGTCG ACGGCGAGCG CCGGACGTAC AGTGTCACGC TCGAGGAGAA CCCGCGCGAC GGGACGGGGT TTCTCGGAGT CGTCGGCATC CAGCCCGGGA TCAGCGGGAT CGTCGTCAAC GACTTCGGTA TCCAGTCCTA TCCCGCCGAA ACATATCTCG GTATCCTCGG CGGCAACGGT GATCTGGACA TCCCGCTGGG TCAGCAGATC ATCCTGCTGA TCACACTCCC CCTGGCGAGC GTGGCAGCCC CCGGACTCAC GTTCAACTTC GCGGGCTTTC TCGGCCCGAT CACCGACTTC TATACGGTCA CGGGGCCGCT CGCCGGTCTC GGTGGCGGGG TGTTCGTCGT CGCGAACCTG CTGTTCTGGA CCGCATGGGT CAATCTCAAT CTCGCCGTCT TCAATCTGAT CCCGCTGTTC CCTCTGGACG GGGGTCATTT ACTCCGGACA GGGACGGAGT CGATCGTCGC CCGGACGCCC GTGAACAAAC GCTGGGCAGT TCGGACCGTG ACTGTCTCGG TCGGGCTGGT GATGTTCGGA AGCCTCATGC TGATGCTGTT CGGCCCACAG TTGCTAACCT GA
|
Protein sequence | MGTVTQSSLQ ERSFLLTMVD TLTLVLAGVL AYSLGATALD RRGYLPAFLK VSGPITTLHT KRGRAALDWL ARPKRFWRAF GNIGVGFGLF ILAGMFLTVL FSGISSLQQP EANPIQEPKN ALVIPGLNDF LPLAAAPEII FGLLVGMIVH EGGHGLLCRV EDIDIDSMGV ALFTIIPLGA FVEPDEESRA KADRGAQTRM FAAGVTNNFV ITALAFLLLF GPVAGSIQAV GGVAVGGALP GSPAADASLG EGDVITGING TEVTNQSTLR DALGDADGRT VAVSLHEDET KRIQRSVFVT VAVLDGPLGI DRGDTITSVN GTAVHTVSGL VDAVENRTVA TLETADGNQT TGPIGAYVSR VAEGGPFADD GGPAGESVVI TRFDGTRIIG QSQLLDALEG TDPGETVDIE AYVDGERRTY SVTLEENPRD GTGFLGVVGI QPGISGIVVN DFGIQSYPAE TYLGILGGNG DLDIPLGQQI ILLITLPLAS VAAPGLTFNF AGFLGPITDF YTVTGPLAGL GGGVFVVANL LFWTAWVNLN LAVFNLIPLF PLDGGHLLRT GTESIVARTP VNKRWAVRTV TVSVGLVMFG SLMLMLFGPQ LLT
|
| |