Gene Huta_2535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2535 
Symbol 
ID8384840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2599503 
End bp2601749 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content65% 
IMG OID644973612 
Producttransglutaminase domain protein 
Protein accessionYP_003131432 
Protein GI257053599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCA CTGACCGCAA CCCGGACATC GGCTTGTTCC GGGTGCTCGC GCTCGGTGGG 
GCACTCCTGG TGATCGGCTC GTTCGTCGGC GTCGTCCACG ACATCGTCGA CGTGATCAGC
GATCCGACTG CGCTACAGTT CGTGGTGGTA CTGACGTTCG TCGGTTCGAC GATCGCCGCA
CGATATCTCA CCGTTCGGAA CGCGATCATC GTCGCATTCG TGTTGCTGTC TGGCGGTCTC
GGGTGGTACC TCATGGGACT CTCCGGGGCG TCGCTGTGGC CATGGCCACA CATCAGGTAC
ACGCTCGCGC TCATGACCGG CAACTCGATC CTCGGGATCG TCAATCTGGA AGCGTGGGTG
ATGGCCGTGA CGCCAGCCCC GATATTTCTG ACGTGGTATC TCGCCGTCCG GCGCTGTCAC
GTCGCGGGCG CCATCGTCGG CGGGGCAACG ATGTTGTTTT TCGTCCTCAC GGGCGACGCG
GGGGCAGATC TGACACTGCT TGGCGTCGTG GGCGTCGTCT CGCTCGTTGG CGCGGGCGAA
CTTGATCGGC TGGATGCCCC TATCAGCGAA GCGGACGTCG TCGCGACAGT CGTCGCCGTC
GCCATCGTCG TCTCCGCGAC GATCACCATC GTTCCCGCCG GTGCCGTCTT CGCGTTCTCG
CCGGACAGCG GTCTGAGCGG TTCGGCCGCG GGCGGTTCCG GCGGCGGCAG TTCCGCCGAC
ACCCTCGAAG GAAGTCTCAT CAGTACGTCC GAGGAGCTGT CGATCCAGGG GAGCCTGGAC
CTGACGTCCA AACTCAGGTA CGCCGTCAGA AGCGACGAGG AAACGTACTG GCGAGTCGGT
GCCTACGACC TCTACACCGG TGACGGCTGG GTCAGGCGTG GTGAAACCGG TCCGGTCGAC
CGACGGCTCG GCTCGCCGCC GGGACGGTCC CGGACTGTCG AGCAAACCTA CCGGGCGATC
ACCAAGATCG GGACGATGCC GGCCGTCTGG CGGCCGAGCG ATATCGCGGG ACCGGCAGCC
GATCTCGCCC GTGGCACCCG GCTGGGTGGC CTGCGACCCA GTCAGATGCT CGCCGCCAAC
GACACGTATC GAGTGACCAG CGAAATGCCG ATTGCTGCGG GGGCGGACCT CCGGGAAGCG
GGGACAGCGT ATCCCGATTC CATCGCGAAC CAGTACCTCC AGCTACCCGA CAGTACGCCG
GATCGAATCG GTGAGCGAAC ACAACGCCTG ACATCGAACG CCGACAATCC CTACGATACG
GCCCGTGTCA TCGAACGGTG GCTGGAAACC AACCGTGAGT ACTCACTCGA CGTCTCGAAG
CCGTCCGGGT CGATCGCCGA CTCGTTCCTC TTCGAGATGG AACGGGGGTA CTGTACGTAT
TACGCGACGA CGATGACCAC CATGCTCAGA ACGCAGGGCA TTCCCGCACG GTTCGTGGTC
GGGTACACGT CGGGCCAGCG CGTGGCCGAA GACGAATGGG TCGTCCGCGG CCACAATTCC
CACGCCTGGG TCGAAGTGTA CTTCCCCGAG GTCGGGTGGA TCCGGTTCGA TCCGACACCG
GCCGGCCCGC GGGAATCGAC TGCCCAGCAG GATCTCGACG CAGCTCGTGA AGCCAACGAG
TCTAACGTCG ACACCAACCG AAGCCAGGAC GGCGAGTGGA CGCCCACACC GACCGAAACA
GAAACCCAGA GCGGGGACAG TCAACAACAA ACCTTCGACG AGAGCGACAT CTCGATCCCC
GAATACTACG CTCCGGACGA GGTAGAACTC AACGGAAGCA ACGTCTCCAG TGGCACAGTC
ACCGAGTTCG CTCGTCCCGG GAACGCACCG GGCTTTGGCA CTGCCAACGA CAGCCAGGAT
CGCGAAGACG ACTCGGGGCT GCCGATTCCA AGTGTCGAAC AGTTCACACT CGGCGCACTC
GCGATGCTCG GGTTCGCCGG CGTCGCCCGA CGAACCGGGC TGACCCGCCG AACCTATCGG
GCAGTCTGGC TCCGATGGCA ACCCCGCGAG GACCCCGCGA CGGACATCGA ACGCGCCTTC
GAGCGGCTGG AATGGCTCCT CGAACGGCGT CACCGCAACC GCCAGCCCGG CGAAACCGTT
CGGGACTACT TCGAGGCGGT AGACGCCGAC GAGCGCGCCT GGCGCGTCGC AACGATCCGG
GAACGCTCGC GGTACGCCGG CACCGTCGAC CGCGAGGCGG CCGACGAGGC GATCGAACTG
GTCGACGAGC TCGTCGGCGA ATCGTGA
 
Protein sequence
MNITDRNPDI GLFRVLALGG ALLVIGSFVG VVHDIVDVIS DPTALQFVVV LTFVGSTIAA 
RYLTVRNAII VAFVLLSGGL GWYLMGLSGA SLWPWPHIRY TLALMTGNSI LGIVNLEAWV
MAVTPAPIFL TWYLAVRRCH VAGAIVGGAT MLFFVLTGDA GADLTLLGVV GVVSLVGAGE
LDRLDAPISE ADVVATVVAV AIVVSATITI VPAGAVFAFS PDSGLSGSAA GGSGGGSSAD
TLEGSLISTS EELSIQGSLD LTSKLRYAVR SDEETYWRVG AYDLYTGDGW VRRGETGPVD
RRLGSPPGRS RTVEQTYRAI TKIGTMPAVW RPSDIAGPAA DLARGTRLGG LRPSQMLAAN
DTYRVTSEMP IAAGADLREA GTAYPDSIAN QYLQLPDSTP DRIGERTQRL TSNADNPYDT
ARVIERWLET NREYSLDVSK PSGSIADSFL FEMERGYCTY YATTMTTMLR TQGIPARFVV
GYTSGQRVAE DEWVVRGHNS HAWVEVYFPE VGWIRFDPTP AGPRESTAQQ DLDAAREANE
SNVDTNRSQD GEWTPTPTET ETQSGDSQQQ TFDESDISIP EYYAPDEVEL NGSNVSSGTV
TEFARPGNAP GFGTANDSQD REDDSGLPIP SVEQFTLGAL AMLGFAGVAR RTGLTRRTYR
AVWLRWQPRE DPATDIERAF ERLEWLLERR HRNRQPGETV RDYFEAVDAD ERAWRVATIR
ERSRYAGTVD REAADEAIEL VDELVGES