Gene Huta_1364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1364 
Symbol 
ID8383643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1336524 
End bp1338881 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content65% 
IMG OID644972427 
Productglycosyltransferase 36 
Protein accessionYP_003130273 
Protein GI257052440 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATACG GACACGTCGA TCAGGAAACG GGCGAATACG TCATCGAACG ACCCGACACG 
CCGACGCCGT GGATCAACTA CCTCGGCGAG GGCGTCTACG GCGGCATCGT CTCGAACACC
GGTGGCGGCT ACAGCTTCTA TAAAGACCCG AAGAACCAGC GCGTGACGCG CTATCGGTAC
AACGCCGTCC CGGACGACCA GCCCGGTCGC TACGTCTACC TGCAGGATCG GGAAAGCGGC
GAGTACTGGT CGCCGACCTG GCAACCCGTG AAGACGGACC TCGACGACTA CGAGTGTCGT
CACGGGCCGG GCTACACCAC GATCGAGAGC AAATACGACG GCATTGCGGC CGAGATGACG
TATTTCGTCC CGCTCGGTGA GGACTGCGAA CTGTGGGTGC TGGACATCGA AAACGAGGGC
CGGGAGACCC GCACCCTCGG GGCGTTCTCC TACGTCGAGT TCTCGTTCCC GGACGCGCTG
GGTGACCAGA CCAATCTCGA CTGGACGGGC CAGATCATGC GCTCGCGGGT CGACGAAACC
GAGGGCGTCA TCGAGATGTA CTCTTCCGCC CAGGAACTCA GCTACACCCA CGCGACCAGT
GCCGAGGTCG TCGGCTTTGA CACGCAGCGG CGGGAATTCG TCGGCCAGTA CGGCAGCCTG
GAGGAACCAG CCGTCCTGGA AGCGGGAGCA GCCAGAAACT CACAAGCCAC ACACGACAAC
GTCATCGGTT CCTTCGAGCA CGACCTCGAA CTCGAACCGG GCGAGAGCGA GCGAATTGTC
TTCATGACCG CGCCCGAACG AAACGACGAC CTCATCGCGA AGTACGACGA TCCGTCGGTC
GTCGACGAGG CGTTCGAGGC CCTGCAAGCG GAGTGGGAGG ACTATCTCTC GACGCTGCAG
GTCCAGACCC CTGACGAGCA GATGAACACA ATGGTTAACG TCTGGAACCC GGTGCAGTGT
CGCTCGACGC TGTACTGGTC CCGGACGGCC TCCCGCTACC AGGCCGGGTT GGGCCGCGGG
ATGGGGACGC GTGACTCCAG CCAGGACACG CTCTCGATCG TCCACGCCGT GCCCGATCAA
GTCCGGGAGA CCCTGGAGAT GCTCTGGAAA CTCCAGTTCC CCGACGGCCA CGCCTGGCAC
CAGGTTTACC CACTCAGCGG CGAGGGTGAC GCCGGTCTCG CCACGGAGGA TCCCTCCAAG
CCACAGTGGT TCTCCGACGA CCATCTCTTT CTCGTGCTGG GAACCGTCCA GTATCTCAAG
GAGACCGGCG ACTACGACTT CCTGGAAGCC GATATTCCAT TCGAGGACGG CTCCACGGGC
TCCGTCCGCG AGCACATCGA GCGTGCCATC GAGTTCACCG ACGAGCACCG CGGCACACAC
GGCCTGCCGC GGATGGGCTA CGCCGACTGG AACGACTCGC TGAACCCCGA CGACGGCAGC
GGCGAGGCCG CCAGCGTCAT GGTCGCGATG ATGTACTGTC GCGTCCTCGA TGAAGTCGCC
GGCCTCTACG AGTTCCTGGG CGAGGACGAC CGGGCAGCCG AACTCCGCGC CAAACGCGAG
GCAGAGATCG AGCGGATCGA CGAGCACGCC TGGGACGGCG ACTGGTACAC CCGCGCCTAC
GACGACGAGG GGCGCGTCAT CGGATCAGCC AGTGAAGACT ACCAACAGAT CTCGCTGAAC
ACCCAGACCT GGGCCGCACT CGGCGGCGTC GACGACGACC GCGCCCGGGA AGCCATGGAA
AACGCCCACG ACCGCCTCAA CACCGAGTAC GGCTTCGCAC TGCTCGACCC GCCGTGGGAA
GGCGAGGGCA AGATCGACCG GATCGGCGGC ACGACGACCT ACCCGCCCGG GGCCAAGGAG
AACGGCGGCA TCTTCTGTCA CGCCCACACG TGGTCGGTCG TCGCCGCCGG CCTGCTGGGC
GACGGCGAGC GCGCCTACCA GTACTACCGG CAACTCCTCC CGCTGGCCCA CGACGACGTC
GCTGATCTCC GGCGCGTCGA ACCGTACGTC TACTGCCAGA ACGTCCTCGG GCCCGCACAC
GAGGAGTTCG GCGTCGCGAA GAACTCCTGG CTGACCGGGA CCGCATCATG GGCGTACGTC
GGTGCGACCC AGTACCTCCT TGGCGTTCGA CCGACCTTCG ATGGCCTCCT GGTCGACCCG
ACGATCCCCG CAGAGTGGGA CGGCTTCGAG ATGGAACGGG AGTTCCGCGG CGCGACCTAC
GAGATCGCCG TCGAGAATCC CGACGGCGTG GAGAGTGGCG TCGCGAGTGT CGAAGTCGAC
GGCGAGGTGA TCGAGGGCAA TGTGGTGCCC GCGTTCGAGG ACGGCGAGAC CCATGAAGTT
CGGGTCGTGA TGGGCTGA
 
Protein sequence
MTYGHVDQET GEYVIERPDT PTPWINYLGE GVYGGIVSNT GGGYSFYKDP KNQRVTRYRY 
NAVPDDQPGR YVYLQDRESG EYWSPTWQPV KTDLDDYECR HGPGYTTIES KYDGIAAEMT
YFVPLGEDCE LWVLDIENEG RETRTLGAFS YVEFSFPDAL GDQTNLDWTG QIMRSRVDET
EGVIEMYSSA QELSYTHATS AEVVGFDTQR REFVGQYGSL EEPAVLEAGA ARNSQATHDN
VIGSFEHDLE LEPGESERIV FMTAPERNDD LIAKYDDPSV VDEAFEALQA EWEDYLSTLQ
VQTPDEQMNT MVNVWNPVQC RSTLYWSRTA SRYQAGLGRG MGTRDSSQDT LSIVHAVPDQ
VRETLEMLWK LQFPDGHAWH QVYPLSGEGD AGLATEDPSK PQWFSDDHLF LVLGTVQYLK
ETGDYDFLEA DIPFEDGSTG SVREHIERAI EFTDEHRGTH GLPRMGYADW NDSLNPDDGS
GEAASVMVAM MYCRVLDEVA GLYEFLGEDD RAAELRAKRE AEIERIDEHA WDGDWYTRAY
DDEGRVIGSA SEDYQQISLN TQTWAALGGV DDDRAREAME NAHDRLNTEY GFALLDPPWE
GEGKIDRIGG TTTYPPGAKE NGGIFCHAHT WSVVAAGLLG DGERAYQYYR QLLPLAHDDV
ADLRRVEPYV YCQNVLGPAH EEFGVAKNSW LTGTASWAYV GATQYLLGVR PTFDGLLVDP
TIPAEWDGFE MEREFRGATY EIAVENPDGV ESGVASVEVD GEVIEGNVVP AFEDGETHEV
RVVMG