Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0155 |
Symbol | |
ID | 8382417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 150803 |
End bp | 153070 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644971213 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003129076 |
Protein GI | 257051243 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.167458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCG AGGAGAAAGT CGCTCAGCTG GGTTCGGCCA ACCCGTCGCA TTTCATCGAG GACGAAACGC TCGAGAGAGA CGGCGTCGAA ACACAGCTGA GCGACGGGAT CGGTCATCTC ACCCGCACTG CCGGCGAGGG CGATCTCGAT CCGAAACGGG CCGCGCTACT CACGAACCAA CTCCAGGAGT ACCTCATCGA GGAAACGCGG CTCGGTATTC CGGCGATTCC ACACGAGGAA TGCCTGAGCG GCTACATGGG TCCCCAGGGA ACTGTCTTCC CGCAGATGAT CGGCATAGCG AGCACGTGGT CGCCGCAACT GCTCGAATCG GTCACCGGCG TGGTCCGAAC CCAGCTGCAG GCGACCGGTG CGGTCCACGG GCTTTCGCCA GTGCTCGACG TTGCCCGCGA CCTCCGCTGG GGCCGTGTCG AGGAGACGTT CGGCGAAGAC CCCCAACTCG TGGCAGCGAT GGCGCGTGCC TACGTTTCGG GATTACAGGC CACGGACGCC GTTCGGGGAG ACGATCGTGA CGTCCACGCC ACACTCAAGC ACTTTGCTGG CCACGCAATG GGGGAAGGCG GCAAGAACCG CTCGTCGGTC CAGATCGGTG AGCGCGAACT CCGGGAGGTC CACCTCTACC CCTACGAGGC GGTGATCCGG AACGGCGACG CCGCCTCGAT CATGAACGCG TATCACGACA TCGACGGCGT CCCCTGTGCC AGTTCCGAGT GGCTCCTGAC GGACGTCCTC CGTGGAGAGT GGGGCTTTGA CGGGACAGTT ATTTCGGACT ACGGGAGCGT TGCTCTCCTT GACGGCGAAC ATGGCGTCGC CGCCAACAAA CGCGAGGCCG GCGTGGCTGC ACTGGAGGCG GGTCTCGATG TCGAGTTGCC AAATACCGAC TGCTACGGTG ACCCACTGCT CGAAGCGTTC GAGGCGGGAG CGGTCAGCGA AGCGACGATC GACACCGCCG TCGGTCGCGT CCTCCGGGCG AAGATCGAAA CCGGGGTGCT CGATGATCCG TTCGTCGATC CCGAAGCTGT CCCGCAAGCC TTCAACACCG ACGAACAAGC CAGACTCGCC AGGACGGCGG CCCGTGAGTC GATCACGCTG CTGGAAAACG ATGGGCTGCT TCCGTTGGGA GACGACCTCG ACACGGTCGC CCTTCTCGGC CCGAAGGCCG ACGACGATCA GGAACTCCTG GGTGATTACG CATACCCCGC GCACTTCAAT CAGGAAGAAA CCGACTTCGA GGCGACGACG CCACGTGACG CCCTCGAAAC GCGCGGTGCA GACGCTGGGT TCGCCGTCGA ATACGTCGAA GGCTGTACCA CGTCCGGCCC CTCGACCGAG CAGTTCGACG CGGCTGCCGA GGCAGCCACG GATGCGGACG TGGCGATCGC CTGCGTCGGC GCACGCTCGG CCGTCGATCT CTCCGACGAC GATCTCGCGC ACCGCAACCA GTCGATGATC CCGACCAGCG GTGAGGGGAG CGACGTCACC GATCTCGGGC TGCCGGGCGT GCAGGCGGAC CTGCTCGACC GCCTCACCAA GACGGAGACG CCGGTGATCG TCGTGCTGGT CAGCGGCAAA CCCCACGCGA TCCCGGAGAT CGCCGAGACA GTCCCATCCC TGCTTCACGC GTGGCTACCC GGCGAGGAGG GCGGCAACGG AATCGTGGAC GTACTGTTCG GCGACCACAA TCCGAGCGGA CACCTGCCAC TTTCGATCCC TAAATCCGTC GGGCAGCAGC CAGTCTACTA CAGCCGGAAA CCCAATTCCG CGAACGAGGA GCACGTCTAC GACGATGGCG AGCCGCTGTA TCCGTTCGGC TACGGGCTGA GTTACACCGA CTTCGAGTAC GGCGAACTCG AACTCGACGC CGAGACGGTC GCGCCGATGG GCACGCTAAC CGCGAGCGTC ACAGTCACGA ATGCAGGTGA CGTCGCCGGT GACGACGTGG TCCAGCTCTA CCAGCACGCC GAGAACCCGA GCCAGGCCCG TCCCGTCCAG GAACTGCTCG GCTTCGAGCG CGTGCATCTC GAACCCGGCG AGTCAAAGCG CGTCAGCTTC GAAATCGACA TGACTCGACT TGCCTACCAC GATTTGGCCA TGAACCTCGT CGTCGAAGAA GGGTCATACG AACTCCGCGT CGGCACATCC GCGGCCGATA TCGTCGACAC TGCTGCGTTC GACGTGACGG ACACGAAGGC CGTCCCGGGC TCCGCTCGCT CGTACCTCAC CCAAACCAGT ATCGAGCCGA TCAGCTAG
|
Protein sequence | MTIEEKVAQL GSANPSHFIE DETLERDGVE TQLSDGIGHL TRTAGEGDLD PKRAALLTNQ LQEYLIEETR LGIPAIPHEE CLSGYMGPQG TVFPQMIGIA STWSPQLLES VTGVVRTQLQ ATGAVHGLSP VLDVARDLRW GRVEETFGED PQLVAAMARA YVSGLQATDA VRGDDRDVHA TLKHFAGHAM GEGGKNRSSV QIGERELREV HLYPYEAVIR NGDAASIMNA YHDIDGVPCA SSEWLLTDVL RGEWGFDGTV ISDYGSVALL DGEHGVAANK REAGVAALEA GLDVELPNTD CYGDPLLEAF EAGAVSEATI DTAVGRVLRA KIETGVLDDP FVDPEAVPQA FNTDEQARLA RTAARESITL LENDGLLPLG DDLDTVALLG PKADDDQELL GDYAYPAHFN QEETDFEATT PRDALETRGA DAGFAVEYVE GCTTSGPSTE QFDAAAEAAT DADVAIACVG ARSAVDLSDD DLAHRNQSMI PTSGEGSDVT DLGLPGVQAD LLDRLTKTET PVIVVLVSGK PHAIPEIAET VPSLLHAWLP GEEGGNGIVD VLFGDHNPSG HLPLSIPKSV GQQPVYYSRK PNSANEEHVY DDGEPLYPFG YGLSYTDFEY GELELDAETV APMGTLTASV TVTNAGDVAG DDVVQLYQHA ENPSQARPVQ ELLGFERVHL EPGESKRVSF EIDMTRLAYH DLAMNLVVEE GSYELRVGTS AADIVDTAAF DVTDTKAVPG SARSYLTQTS IEPIS
|
| |