Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0863 |
Symbol | |
ID | 8383136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 829893 |
End bp | 831350 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644971927 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003129779 |
Protein GI | 257051946 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.686198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAT CATCATACGG CGTACAAGCC GAGGGTATCG ACACCGACGA CGTGACGATC GCCTACATCG GCGGCGGGAG CCGCGAGTGG GCACCGAAGT TCTTCCGGGA CCTTGCGATC TCTGACCTCT CGGGAGAAGT GCGGCTACAC GACATCGACC ACGAGAGCGC CGAGCGCAAC GCCGAGTTCG GCAACTGGGT CCAGGACCGC GATGAAGTCG AAGCCGAATG GGAGTACGAA GCCGTCGCGG ATCGCGACGA AGCGCTCGAC GGCGCTGACG CTGTCGTCCT TTCGACGCAG TACAACCCCG CGGAGACCTT CGTCCACGAT CTGGACATCC CCAAGGAGCA CGGTATCTAC GGCGCGGTCG CCGCCACCAT CGGCCCGGGC GGGATCTTCC GGGCGATGCG GACGATCCCC GTCTACCGGG AGTTCGCCGC CTCCATCCGC GAGCAGTGTC CCGACGCCTG GGTGTTCAAC TTCACCAACC CCGTCCACTT CGTCACCCGC GCGCTGTACG ACGAGTATCC CGACATCAAC GCTGTCGGCT TCTGTCACGA GGTGCTGTGG ACGCGCCATC ATCTGGCGAA GATCGTCGAG GAAGAACTCG GCGAGGAGGC CGCGCGGTCG GACATCTCGG TCAACGTCAA GGGCATCAAC CACTTCACGT GGATCGACGA AGCCCGGTAC AAGGGCCGGG ACCTCTGGCC CCTGCTCGAA GACCTGGTCG ACACCGACCG CGCGAACCGC GAGTTCACGC CCGAGGACCT CGAAGACGAT TCGCCGTTCA CCGACAAACA GCAGGTCACC TGGGAGCTGT TCCGCCGCTT CGGCGTCTTC CCGGCGGCGG GCGACCGCCA CCTCGTCGAG TACGCGACCT CCTTCCTCGT CGGCGGCAAG GAGGGGCTCA ACCGCTGGGG CGTCAAACGG ACCACCAGCG ACTATCGCGC GAAACACTGG AACCCCGCCG AGTCCGAACA GACCACCGAC GTCGAGGCCT GGATGAACGG CGAGCGGGAG TTCGAACTCT TCCATTCGAA CGAAATCTTC GACGACATGA TGATGGCGCT GGCCGGGGAA GACACGATGG TCGCGAACGT CAACATGCCC AACGAGGGGC AGGTCACTGA CATCGAAGAC GGTGCCGTCG TCGAAACCAA CGCCGTGATC CGAGAGGGCG AGATCAAGCC GACCACCGCC GGTGGGTTCC CCCGTCCGGT CCGGTCGATG ATCAACGGCC ACGTCGACAC CATCGAATCG ATCATCGAGG CCTCTCGCAC CGGCGACATC GACGAAGCCT TCGCCGGCTT CCTGCTCGAC CAGCAGGTCC GGACGCTCCA GACCGAGGAG GCCCGCGAGA TGTTCGCCGA GCTGGTCGCT GCCGAGGAAG AGTATCTCCA GGGCTGGGAT CTCGACGGCT CGGACGTGCT GGCGGAAGCC GACGCCTACG ACGCCTAA
|
Protein sequence | MTESSYGVQA EGIDTDDVTI AYIGGGSREW APKFFRDLAI SDLSGEVRLH DIDHESAERN AEFGNWVQDR DEVEAEWEYE AVADRDEALD GADAVVLSTQ YNPAETFVHD LDIPKEHGIY GAVAATIGPG GIFRAMRTIP VYREFAASIR EQCPDAWVFN FTNPVHFVTR ALYDEYPDIN AVGFCHEVLW TRHHLAKIVE EELGEEAARS DISVNVKGIN HFTWIDEARY KGRDLWPLLE DLVDTDRANR EFTPEDLEDD SPFTDKQQVT WELFRRFGVF PAAGDRHLVE YATSFLVGGK EGLNRWGVKR TTSDYRAKHW NPAESEQTTD VEAWMNGERE FELFHSNEIF DDMMMALAGE DTMVANVNMP NEGQVTDIED GAVVETNAVI REGEIKPTTA GGFPRPVRSM INGHVDTIES IIEASRTGDI DEAFAGFLLD QQVRTLQTEE AREMFAELVA AEEEYLQGWD LDGSDVLAEA DAYDA
|
| |