Gene Huta_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1973 
Symbol 
ID8384267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1996660 
End bp1998471 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content65% 
IMG OID644973043 
Productpeptidase M50 
Protein accessionYP_003130874 
Protein GI257053041 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.246801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTACGG TAACGCAATC CTCTTTGCAG GAGCGGTCAT TCCTACTGAC AATGGTCGAT 
ACGCTCACGT TGGTTCTCGC GGGCGTTCTC GCGTACTCTC TGGGGGCGAC GGCGCTCGAT
CGCCGGGGGT ATCTGCCCGC GTTCCTGAAA GTCTCCGGGC CCATCACGAC GCTCCATACC
AAGCGTGGAC GGGCCGCACT CGATTGGCTC GCCCGGCCGA AGCGATTCTG GCGGGCCTTC
GGGAACATCG GCGTCGGGTT CGGGCTATTC ATCCTCGCCG GGATGTTCCT GACAGTCCTG
TTCTCGGGGA TCTCGAGTCT CCAGCAACCG GAAGCGAATC CGATCCAGGA GCCGAAAAAC
GCCCTGGTCA TCCCCGGCCT CAACGACTTC CTCCCGCTTG CGGCCGCGCC GGAGATCATC
TTCGGACTGC TCGTCGGGAT GATCGTCCAC GAGGGGGGCC ACGGCCTGCT CTGCCGGGTC
GAGGACATCG ACATCGACTC GATGGGCGTT GCCCTCTTTA CGATCATCCC GCTGGGGGCG
TTCGTCGAAC CCGACGAGGA AAGCCGGGCG AAAGCCGATC GCGGCGCGCA AACGCGAATG
TTCGCGGCGG GAGTCACCAA CAACTTCGTG ATCACGGCAC TCGCCTTCCT CCTGCTCTTT
GGCCCGGTCG CGGGGTCGAT CCAGGCCGTC GGCGGCGTCG CTGTCGGCGG TGCACTCCCT
GGATCACCGG CCGCCGACGC GTCCCTCGGG GAAGGTGACG TCATCACGGG GATCAACGGG
ACCGAGGTGA CCAACCAGTC GACGCTACGT GACGCACTCG GGGACGCCGA CGGGCGGACC
GTCGCGGTGT CTCTCCACGA GGACGAAACG AAACGCATCC AGCGATCGGT GTTCGTCACC
GTGGCCGTCC TCGACGGGCC GCTCGGGATC GATCGCGGTG ACACGATCAC GAGTGTCAAC
GGAACGGCCG TCCACACGGT CAGCGGGCTG GTCGATGCCG TCGAGAACCG GACGGTCGCC
ACACTCGAAA CGGCCGACGG AAATCAGACG ACCGGGCCGA TCGGCGCGTA CGTCAGCCGC
GTCGCCGAGG GCGGGCCGTT CGCCGACGAC GGCGGGCCGG CCGGCGAGTC AGTCGTCATA
ACGCGCTTCG ACGGAACGCG GATCATCGGA CAGTCACAGC TGCTCGACGC ACTCGAGGGG
ACCGACCCTG GCGAGACGGT CGACATCGAA GCGTACGTCG ACGGCGAGCG CCGGACGTAC
AGTGTCACGC TCGAGGAGAA CCCGCGCGAC GGGACGGGGT TTCTCGGAGT CGTCGGCATC
CAGCCCGGGA TCAGCGGGAT CGTCGTCAAC GACTTCGGTA TCCAGTCCTA TCCCGCCGAA
ACATATCTCG GTATCCTCGG CGGCAACGGT GATCTGGACA TCCCGCTGGG TCAGCAGATC
ATCCTGCTGA TCACACTCCC CCTGGCGAGC GTGGCAGCCC CCGGACTCAC GTTCAACTTC
GCGGGCTTTC TCGGCCCGAT CACCGACTTC TATACGGTCA CGGGGCCGCT CGCCGGTCTC
GGTGGCGGGG TGTTCGTCGT CGCGAACCTG CTGTTCTGGA CCGCATGGGT CAATCTCAAT
CTCGCCGTCT TCAATCTGAT CCCGCTGTTC CCTCTGGACG GGGGTCATTT ACTCCGGACA
GGGACGGAGT CGATCGTCGC CCGGACGCCC GTGAACAAAC GCTGGGCAGT TCGGACCGTG
ACTGTCTCGG TCGGGCTGGT GATGTTCGGA AGCCTCATGC TGATGCTGTT CGGCCCACAG
TTGCTAACCT GA
 
Protein sequence
MGTVTQSSLQ ERSFLLTMVD TLTLVLAGVL AYSLGATALD RRGYLPAFLK VSGPITTLHT 
KRGRAALDWL ARPKRFWRAF GNIGVGFGLF ILAGMFLTVL FSGISSLQQP EANPIQEPKN
ALVIPGLNDF LPLAAAPEII FGLLVGMIVH EGGHGLLCRV EDIDIDSMGV ALFTIIPLGA
FVEPDEESRA KADRGAQTRM FAAGVTNNFV ITALAFLLLF GPVAGSIQAV GGVAVGGALP
GSPAADASLG EGDVITGING TEVTNQSTLR DALGDADGRT VAVSLHEDET KRIQRSVFVT
VAVLDGPLGI DRGDTITSVN GTAVHTVSGL VDAVENRTVA TLETADGNQT TGPIGAYVSR
VAEGGPFADD GGPAGESVVI TRFDGTRIIG QSQLLDALEG TDPGETVDIE AYVDGERRTY
SVTLEENPRD GTGFLGVVGI QPGISGIVVN DFGIQSYPAE TYLGILGGNG DLDIPLGQQI
ILLITLPLAS VAAPGLTFNF AGFLGPITDF YTVTGPLAGL GGGVFVVANL LFWTAWVNLN
LAVFNLIPLF PLDGGHLLRT GTESIVARTP VNKRWAVRTV TVSVGLVMFG SLMLMLFGPQ
LLT