Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1364 |
Symbol | |
ID | 8383643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1336524 |
End bp | 1338881 |
Gene Length | 2358 bp |
Protein Length | 785 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644972427 |
Product | glycosyltransferase 36 |
Protein accession | YP_003130273 |
Protein GI | 257052440 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATACG GACACGTCGA TCAGGAAACG GGCGAATACG TCATCGAACG ACCCGACACG CCGACGCCGT GGATCAACTA CCTCGGCGAG GGCGTCTACG GCGGCATCGT CTCGAACACC GGTGGCGGCT ACAGCTTCTA TAAAGACCCG AAGAACCAGC GCGTGACGCG CTATCGGTAC AACGCCGTCC CGGACGACCA GCCCGGTCGC TACGTCTACC TGCAGGATCG GGAAAGCGGC GAGTACTGGT CGCCGACCTG GCAACCCGTG AAGACGGACC TCGACGACTA CGAGTGTCGT CACGGGCCGG GCTACACCAC GATCGAGAGC AAATACGACG GCATTGCGGC CGAGATGACG TATTTCGTCC CGCTCGGTGA GGACTGCGAA CTGTGGGTGC TGGACATCGA AAACGAGGGC CGGGAGACCC GCACCCTCGG GGCGTTCTCC TACGTCGAGT TCTCGTTCCC GGACGCGCTG GGTGACCAGA CCAATCTCGA CTGGACGGGC CAGATCATGC GCTCGCGGGT CGACGAAACC GAGGGCGTCA TCGAGATGTA CTCTTCCGCC CAGGAACTCA GCTACACCCA CGCGACCAGT GCCGAGGTCG TCGGCTTTGA CACGCAGCGG CGGGAATTCG TCGGCCAGTA CGGCAGCCTG GAGGAACCAG CCGTCCTGGA AGCGGGAGCA GCCAGAAACT CACAAGCCAC ACACGACAAC GTCATCGGTT CCTTCGAGCA CGACCTCGAA CTCGAACCGG GCGAGAGCGA GCGAATTGTC TTCATGACCG CGCCCGAACG AAACGACGAC CTCATCGCGA AGTACGACGA TCCGTCGGTC GTCGACGAGG CGTTCGAGGC CCTGCAAGCG GAGTGGGAGG ACTATCTCTC GACGCTGCAG GTCCAGACCC CTGACGAGCA GATGAACACA ATGGTTAACG TCTGGAACCC GGTGCAGTGT CGCTCGACGC TGTACTGGTC CCGGACGGCC TCCCGCTACC AGGCCGGGTT GGGCCGCGGG ATGGGGACGC GTGACTCCAG CCAGGACACG CTCTCGATCG TCCACGCCGT GCCCGATCAA GTCCGGGAGA CCCTGGAGAT GCTCTGGAAA CTCCAGTTCC CCGACGGCCA CGCCTGGCAC CAGGTTTACC CACTCAGCGG CGAGGGTGAC GCCGGTCTCG CCACGGAGGA TCCCTCCAAG CCACAGTGGT TCTCCGACGA CCATCTCTTT CTCGTGCTGG GAACCGTCCA GTATCTCAAG GAGACCGGCG ACTACGACTT CCTGGAAGCC GATATTCCAT TCGAGGACGG CTCCACGGGC TCCGTCCGCG AGCACATCGA GCGTGCCATC GAGTTCACCG ACGAGCACCG CGGCACACAC GGCCTGCCGC GGATGGGCTA CGCCGACTGG AACGACTCGC TGAACCCCGA CGACGGCAGC GGCGAGGCCG CCAGCGTCAT GGTCGCGATG ATGTACTGTC GCGTCCTCGA TGAAGTCGCC GGCCTCTACG AGTTCCTGGG CGAGGACGAC CGGGCAGCCG AACTCCGCGC CAAACGCGAG GCAGAGATCG AGCGGATCGA CGAGCACGCC TGGGACGGCG ACTGGTACAC CCGCGCCTAC GACGACGAGG GGCGCGTCAT CGGATCAGCC AGTGAAGACT ACCAACAGAT CTCGCTGAAC ACCCAGACCT GGGCCGCACT CGGCGGCGTC GACGACGACC GCGCCCGGGA AGCCATGGAA AACGCCCACG ACCGCCTCAA CACCGAGTAC GGCTTCGCAC TGCTCGACCC GCCGTGGGAA GGCGAGGGCA AGATCGACCG GATCGGCGGC ACGACGACCT ACCCGCCCGG GGCCAAGGAG AACGGCGGCA TCTTCTGTCA CGCCCACACG TGGTCGGTCG TCGCCGCCGG CCTGCTGGGC GACGGCGAGC GCGCCTACCA GTACTACCGG CAACTCCTCC CGCTGGCCCA CGACGACGTC GCTGATCTCC GGCGCGTCGA ACCGTACGTC TACTGCCAGA ACGTCCTCGG GCCCGCACAC GAGGAGTTCG GCGTCGCGAA GAACTCCTGG CTGACCGGGA CCGCATCATG GGCGTACGTC GGTGCGACCC AGTACCTCCT TGGCGTTCGA CCGACCTTCG ATGGCCTCCT GGTCGACCCG ACGATCCCCG CAGAGTGGGA CGGCTTCGAG ATGGAACGGG AGTTCCGCGG CGCGACCTAC GAGATCGCCG TCGAGAATCC CGACGGCGTG GAGAGTGGCG TCGCGAGTGT CGAAGTCGAC GGCGAGGTGA TCGAGGGCAA TGTGGTGCCC GCGTTCGAGG ACGGCGAGAC CCATGAAGTT CGGGTCGTGA TGGGCTGA
|
Protein sequence | MTYGHVDQET GEYVIERPDT PTPWINYLGE GVYGGIVSNT GGGYSFYKDP KNQRVTRYRY NAVPDDQPGR YVYLQDRESG EYWSPTWQPV KTDLDDYECR HGPGYTTIES KYDGIAAEMT YFVPLGEDCE LWVLDIENEG RETRTLGAFS YVEFSFPDAL GDQTNLDWTG QIMRSRVDET EGVIEMYSSA QELSYTHATS AEVVGFDTQR REFVGQYGSL EEPAVLEAGA ARNSQATHDN VIGSFEHDLE LEPGESERIV FMTAPERNDD LIAKYDDPSV VDEAFEALQA EWEDYLSTLQ VQTPDEQMNT MVNVWNPVQC RSTLYWSRTA SRYQAGLGRG MGTRDSSQDT LSIVHAVPDQ VRETLEMLWK LQFPDGHAWH QVYPLSGEGD AGLATEDPSK PQWFSDDHLF LVLGTVQYLK ETGDYDFLEA DIPFEDGSTG SVREHIERAI EFTDEHRGTH GLPRMGYADW NDSLNPDDGS GEAASVMVAM MYCRVLDEVA GLYEFLGEDD RAAELRAKRE AEIERIDEHA WDGDWYTRAY DDEGRVIGSA SEDYQQISLN TQTWAALGGV DDDRAREAME NAHDRLNTEY GFALLDPPWE GEGKIDRIGG TTTYPPGAKE NGGIFCHAHT WSVVAAGLLG DGERAYQYYR QLLPLAHDDV ADLRRVEPYV YCQNVLGPAH EEFGVAKNSW LTGTASWAYV GATQYLLGVR PTFDGLLVDP TIPAEWDGFE MEREFRGATY EIAVENPDGV ESGVASVEVD GEVIEGNVVP AFEDGETHEV RVVMG
|
| |