Gene Huta_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2151 
Symbol 
ID8384445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2201338 
End bp2203239 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content62% 
IMG OID644973220 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003131051 
Protein GI257053218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTT CTCCGTCAAA ACTGGCGGGC CCAGTGCGAC GGAAGGTGGG GGAAGGGGAG 
CGAAAACAGA CGGCCGATTC TGACAAAGTA CTGCCGCTGT ACGTCAGAGC GGAACCGGGG
AAGGTTTCGG CTGCCAAGCG AGCAATAGAG AAGACCGGCA GGGAGATCCG ATCGGTGGAC
GCCGGATACA TCGCGGTCGA CCTGCCGCCG AAAAGTACGC TTACGATCGC GGAATCGGAC
GCGGTGCGTC ACATCCAGGA ACGCCACACG CCGCGTGCAC ATCAGGTACC GGAGCGGAAC
ATTTCAGAGG GCGTCGGCGT GATGCACGCC GACACCCTCC ACGATGAAGG CGTCACGGGT
GACGGCGCCC GCATCGCCGT CATCGACCAC AGGTTCCACA CCGACAACCC GAAATATTCA
GATAGAATCG TTGCGACTGT CGGCGATAGT ACGTATTTCA CGTCTGATAG TGAATACAAC
GATACAAGTT ACGAAGGGCC AACGGAACAA CACGGAACGG CCTGTGCCGA ACTGGTCGCC
GACGTCGCCC CCGACGCGGA ACTGGTTCTC GCGACGACCA TCGGCCCGCA ATCGTTCGGC
CAAATCATGA ACGAGATCGA AAGTTACGAT CCGGACGGGG CGACGATGTC GCTGGGGTAT
TATACCGGAC TCCGCATCGA CGGCGAGGAC CCGATCAGTT CGCGAATCGA TCAGTTCACT
GACGGCGGGC GGCTGTTTGC GAACTCCGCC GGCAACGAGG CGAACGCACA CTGGGACGGA
CAGTTCGAGA ACGACGGCAA CGATCTGATG GTCTTCGACA GTTCGCTGTC GACGCCCACA
CGGTTCCCCG TGGAAATGCC CTATTCGGGC AGCGAGATCC ACGTCCACTG GGATGCCGAC
TGGAGCCAGG ACGACCAGCG CTACAAAGTC CGGGTGTACG ACAACGAAGA CGATCCAGTC
GGTGATAGCT CGGCACTCCT CACTGAACAG ACGACCGATC CCGTCGAGAT CATCTCGGCA
CCGTCGTCGG GAAGTAACCC GTACCATCTC GAAATCGAGA AGGTCGACGC CACCGGCGAC
GAGCACTTCG ACATGTTCAC CTGGTACTCG TCGCTCGGTC GCACAACCGC GCGTCGAAGC
ATCGGGATCC CGGCGACGAG TCCCGACGAA AATCTGCTTT CCGTGGCGGC CGTACAGGCA
ACGGAGTACG GTCGGACCAG CGAGGAACAT CTCAAGCCCT ACTCCTCGCA GGGGCCGACA
CAGGACGGGC GACGGGGGAT CGACATCGCC GCACCGTCGA TGGTTTCGAC GACCGACGAG
GGTGAATACG GCGCGTATGG CGCACTCGAA GACGGTGGCG GCTTCAACGG GACATCGGCA
GCCTCACCAC ATGTCGGCGG GGCGTTCGGG TTGCTGTTCG GCTCGGCGAT CAGCGCCAGT
CCAGTCCAAG CGCGTGACGC ACTGTTCGAT ACGGGACGGT CGATCGTCGA TTCCGATGTC
GCCGAGCCCG GCGAGAACAA CACCAAGATC GGTCACGGAT ACACCGACGT CGCGGCCGCT
CAGGAGTGGT CGACGTCGAT TCATGCGACC GGCGACGTGA TCTCGCCCGG GGAACGCGCG
ACGATAACAG CCGCGGGAAG CGACATCGAG AACATTACCG TAGCGGACCT CTGGACGGAC
TGGTCGGTCG ACTCGACGCA ACCCGACGGC GGGACCTTCA GTGACGACGT TGCGTCCGCC
GGGACGGGGT CATTCTCGTG GGATTCGACG CAGTCGTCGG TCTCCGTGTC GCTGACTGTC
GACGTGCCGA GTCGCTACGT CGGCGGTACG TATGTGGTGG ACGTGATCGG TCAGAAATCC
GGATCGCCCG TCGAGAAGAC GGTCCAGATC GATATCTCCT GA
 
Protein sequence
MSSSPSKLAG PVRRKVGEGE RKQTADSDKV LPLYVRAEPG KVSAAKRAIE KTGREIRSVD 
AGYIAVDLPP KSTLTIAESD AVRHIQERHT PRAHQVPERN ISEGVGVMHA DTLHDEGVTG
DGARIAVIDH RFHTDNPKYS DRIVATVGDS TYFTSDSEYN DTSYEGPTEQ HGTACAELVA
DVAPDAELVL ATTIGPQSFG QIMNEIESYD PDGATMSLGY YTGLRIDGED PISSRIDQFT
DGGRLFANSA GNEANAHWDG QFENDGNDLM VFDSSLSTPT RFPVEMPYSG SEIHVHWDAD
WSQDDQRYKV RVYDNEDDPV GDSSALLTEQ TTDPVEIISA PSSGSNPYHL EIEKVDATGD
EHFDMFTWYS SLGRTTARRS IGIPATSPDE NLLSVAAVQA TEYGRTSEEH LKPYSSQGPT
QDGRRGIDIA APSMVSTTDE GEYGAYGALE DGGGFNGTSA ASPHVGGAFG LLFGSAISAS
PVQARDALFD TGRSIVDSDV AEPGENNTKI GHGYTDVAAA QEWSTSIHAT GDVISPGERA
TITAAGSDIE NITVADLWTD WSVDSTQPDG GTFSDDVASA GTGSFSWDST QSSVSVSLTV
DVPSRYVGGT YVVDVIGQKS GSPVEKTVQI DIS