Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2111 |
Symbol | |
ID | 8384405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2146574 |
End bp | 2148937 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644973180 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003131011 |
Protein GI | 257053178 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCCT TCACGGGACC GACCGAGCAG CATCGACGGA CCCGCCGCCT CGAAACGGGC TGGCGATTTC ACCATGGGGA CGCCGAGGGG GCAGCCGACC CGGACTTCGC GGACGACGAC TGGCGTCCCG TCGAGGCCCC TCACGACTGG AGCATCGAGG GACCGTTCGA TCCGGAGAGT CCGGCCGGCA GCGCGCAGGC GTTCCTGCCC GGTGGCGTCG GGTGGTACCG CCGCGAGTTG CCGAGCGACG TCGAAGGCGA GACCGTCTAC CTTCGCTTTG ACGGCGTCTA TCGCGACAGC GACGTCTATC TCAAGGGCGA GCACGTCGGC AACCGGCCGA ACGGATACAC GAGTTTCAAT CACGACGTCA GCGGGGTCGA GGGTAGCGAG ACACTCGCCG TCCGGGTCGA CAATACCGAC CTGCCGAACT GCCGGTGGTA CTCCGGGTCG GGTATCTATC GACACGTCCA CCTGATCGAG ACATCTGCGC TGCACGTGGT CCCGTGGGGC ACAGACGTCC GCACGCCGGC CGTGACGGAG CGTCGTGCCA GGGTAGACGT CCACACCGAG GTGGCCAACG ACGCCGAGGA GGCGGCCGTC TGTACGCTCT CGACGACGGT CTACGACCCG GCGGGCGAGG TCGTCGCCGA GGCCGCGACC GAGCAGCGAT TCGCCGCCGG ACAGCAACAC ACGTTCGAGC AGGAACTCGC CGTCGCGGAG CCCGCGTTGT GGTCGCCGGA GACGCCCGAA CGGTATCACG TCCGAAGCGT CGTCCACCGC CAGGACCCGG ACGGGACGGC GGAGACGACG GGCGAACCGG TCGACGACTA CGTGACGCAG TTCGGCATCC GGACGGTCAC GTTCTCGGCC GACGAGGGGG TGTTGCTCAA CGGCGAGTCG ATGAACCTCA AGGGAGTCAA CCTCCATCAC AGCGCCGGTG CGCTCGGGGC CGCCGTGCCC GAGCGCGCGC TCGAACGCCG ACTCGAAACT CTGGCAGCGA TGGGGGCGAA CGCCATCCGG ACCGCCCACA ATCCGCCCCA GCCCGAGCTA CTGGAACTCT GTGACCGGAT GGGCTTTCTA GTCATCGACG AAGCGTTCGA CAAGTGGCGT CACGAGAAGA CCGGGAAATT CTTCGAGGAG TGGTGGCGCG AGGACCTCGC AGCGATGATC CGCCGGGACC GCAACCACCC GTCGGTGATC GCCTGGAGCG TCGGCAACGA AAGCTACGAC CACGGCGAGG CGGAGATGCT CGACGATCTG GAAATGCTGG TCGAGGCAGC CAACGACCTC GATCCGACGC GGCCGGCGAC CTACGGCAGC CCGGCCTGGG GCGACGGCCA CGAGGGGATC CTCAAGAACG CCGAGGCGGT CGCCGAGCGG GTCGATCTCT TCTCGGGCAA CTACATGGAA CACCACTACG ACGACCTCCG CGAGCGGGGC GTCGACGTGC CGATCGTCGG TTCGGAGTGC CGGCCGTTCT TCCGTGGGTC GGGCGACGAT CCGCTGGCGT TCGTCCCGCC GAACCCGTGG TTCGACGTCG CCGAGCGCGA CGACGTTGTG GGGCAGTTCA TCTGGAGCGG CTTCGACTAT CTCGGAGAGG CGCGCGAGTG GCCGAGCAAG GGCTGGCCGA CCGGCTTGAT CGACACCTGC GGCGTGCCCA AGCCCCCGGC CGCCTTCCAT CGGAGCGTCT GGAGCGACGA ACCGATGGTC GAGATCGCGG CGTTCGATCC TGCTCGCGAG CGCGCGCCGG CCCGACCGGC GTGGTCCTGG CCGGCGCTCG CCGGTCACTG GACGTTCCCC GACCGCGAGG ACTCCCGCGG GTTCGTCCAC GTCGTGACGT TCACCAACGC CGAGACGGTG ACGCTCTATC AGAACGACGA GCGCCTCGGC GTCCAGCACC TCGCGGACAA CCCCGACCAC ATGATCGAGT GGTACGTCCC CTACGAGTCG GGGACGTTGC GGGCCGTGGC GGAGACCGAC GGCGAGGTCG TCGCGACTCA CGAACTCCAG ACGGCGGGCG ATCCGGCCCG GGTCGAACTC GATCCCGACC GCGAGGCCAT CACGGCCGAC GGGCAGGATC TGGTGTACGC CGACGCGCGG ATCGTCGACG ACGACGGCGT CGTCGTGCCG CGGGCCGACC ACGAGATCGA ATGCTCCGTC AGCGGCGCTG GCGACCTCGC CGGCGTCGAC AACGGCGACC TGGCGAGCAA CGAGTCCTAC ACCGACTCCC GGCGGTCGGC CTACCACGGA ACGGCGCTGG CGATCGTCCA GGCTGATCGT TCGCCCGGCG AGGTGACGAT CACGGCCGAC GTCGAGGGCC TCGAGGGCGA CGAGGTGACG ATCCCGGTTC AGGCCCCGGA GTGA
|
Protein sequence | MTAFTGPTEQ HRRTRRLETG WRFHHGDAEG AADPDFADDD WRPVEAPHDW SIEGPFDPES PAGSAQAFLP GGVGWYRREL PSDVEGETVY LRFDGVYRDS DVYLKGEHVG NRPNGYTSFN HDVSGVEGSE TLAVRVDNTD LPNCRWYSGS GIYRHVHLIE TSALHVVPWG TDVRTPAVTE RRARVDVHTE VANDAEEAAV CTLSTTVYDP AGEVVAEAAT EQRFAAGQQH TFEQELAVAE PALWSPETPE RYHVRSVVHR QDPDGTAETT GEPVDDYVTQ FGIRTVTFSA DEGVLLNGES MNLKGVNLHH SAGALGAAVP ERALERRLET LAAMGANAIR TAHNPPQPEL LELCDRMGFL VIDEAFDKWR HEKTGKFFEE WWREDLAAMI RRDRNHPSVI AWSVGNESYD HGEAEMLDDL EMLVEAANDL DPTRPATYGS PAWGDGHEGI LKNAEAVAER VDLFSGNYME HHYDDLRERG VDVPIVGSEC RPFFRGSGDD PLAFVPPNPW FDVAERDDVV GQFIWSGFDY LGEAREWPSK GWPTGLIDTC GVPKPPAAFH RSVWSDEPMV EIAAFDPARE RAPARPAWSW PALAGHWTFP DREDSRGFVH VVTFTNAETV TLYQNDERLG VQHLADNPDH MIEWYVPYES GTLRAVAETD GEVVATHELQ TAGDPARVEL DPDREAITAD GQDLVYADAR IVDDDGVVVP RADHEIECSV SGAGDLAGVD NGDLASNESY TDSRRSAYHG TALAIVQADR SPGEVTITAD VEGLEGDEVT IPVQAPE
|
| |