Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2399 |
Symbol | |
ID | 8384698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2459785 |
End bp | 2462739 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973472 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003131298 |
Protein GI | 257053465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.466816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAC ACGACGCGAA ATCAGGGGGA ATGGGGCGGA CGACGACCGA CGGCGACACC GATCTGTTCC GTCGGGACCT GCTTGCGGCG ATGGGACTGG GCGCGGGATC GGTCGCGCTC GGGACGGACG TGGCCACGCC GAGCGTGGTC TCCCGAGCAG CCGCACAGAC TGACCTGGGA TTCGATTACG CCCATGCGCT CCAGCAGTCA CTGTACTTCT ACGACGCCAA TCGCTGTGGC GCAACAACGA TGGGCAATCG CCTCCAGTGG CGCGGCGAGT GCCACCACTC GGACACTGAG ATCCCGCTCG ATGCGGCGAC CGAGGACGGC GGGACCAACC TCTCTGGGAG CTTCATTGAG GAGTACAGCG ACGTGCTCGA CCCCGACGGC ACCGGGACGA TCGACGTCAG CGGCGGGTTT CACGACGCCG GCGACCACAT GAAGTTCGGG CTCCCCCAGT CCTACAGCGC CTCGACGCTT TCCTGGGCGC TCTACGAGTT CGAGGACGCC TTCAGGGATG TCGGCTCCTA CGACCACATG GTCGACATCC TGCGGCACTT CGCCGATTAC TTCCTGAAGT CGACGTTCCG GGATGACGAA GGCAACGTCG TCGCCTTCTG CTATCACGTC GGCGAGGGGA GCATCGACCA CAACTACTGG GGGCCGCCGG AACTCCAGTC GTCCGAGGAG TATCCCCGGC CAGCCTACTT TGCTACGCCG GAGGATCCAG CCAGCGACCA GTGCGCCGGG ACGGCCGCGG CGCTGACGAT CACCTCGCTC GTCCTGGAGT CCGAGGATTC GGCGTACGCG GCGGAGTGTC TCGACACGGC ACAGGCGCTG TATGACTTCG CCGTCGAGAA CCGCGGACTT GGGTACGACG GGGGTTTCTA CGACTCGAGT TACGACGAGG ACGAACTCTC CTGGGCGGCG GTCTGGTTGC ACATCGCGAC TGAGGACGAC GCGTATCTGG ATGACATCCT CGCGACCGAC GATTCGGGCA CCTACACCGG GTATCTCGGG GAGATCATCG ACTCGACCGA CGACGACTGG CAGAACATCT GGGTCCACTC CTGGGACACG GTCTGGGGCG GTGTCTTCCT CAAACTCGCG CCGATCACCG ACGACCCCGA GCACTGGCAG ATCGCCCGCT GGAACCTGGA GTATCTCTCT GGCGGTTCGG TCGAGCATGA AGACGACAAC GATACGAACT ACGCCTCGAC GTCGGATGCT GGGTTCACCG TGCTCAACAC GTGGGGGTCG GCCCGGTACA ACGCCGCCGC GCAGTTCCAG GCGATGGTCT ACCGGAAGTA CCGCGACACC GAGAAGGCGG TCGCACTCAC CGACTGGGCG GCGACCCAGA TGAACTACAT CATGGGTGAC AACTCCTTCG GGTACTCGCT GATCGTCGGG TTCACCGACG ACCACGCCGA GCATCCCCAC CACCGGGCTG CCCACGGCTC GAAGGAGAAC AGCATGGAGG AACCCGAAGA GCACCGCCAC ACGCTGTGGG GTGCCCTGGT CGGCGGGCCC GACGAGGACG ACACCCACGT CGACGAGACC TCGGATTACG TCTACAACGA GGTCGCGATC GACTTCAACG CGGGACTGGT CGGCGCGCTC GCGGGGTTCA ACACCTTCTA CGACGATACC GGCGAGGCAG TCGCGGAGTT TCCCCCGGGT GAGGAGCCGA TCGACGCCTA CTACGCCGAG GGGGAGGTCC TCCAGGAGAA CGCCGACCGG ACACAGGTTC GAGTGACGAT CCACAACGAA TCGATCCACC CGCCCCATCG CGAGGACGGC CTCAGCGCCC GCTATTTCAT CGACGTCAGC GAGTTGCGCG ACGCCGGCCA GTCGATCGAC GCCGTCTCGG TCGAGGTCCA GTACGACCAG CAATCAACCA TGGGCGATGG GTCGGCCGAC GTCTCGGGGC CGATCGCCTG GGACGAGGAT GCCGGCATCT ACTACATCGA ACTCGACTGG TCGGGCAACC AGATCTATGG CGCACGGGAG ATTCAGATCT CGATGATCGC CGAGCAGGAC GACAACTGGG AGAGCAACTG GGATCCATCG AACGACCCGA GCTTCCAGGA CATCGGCGAG GCGGCGACCG TCACCGAGGC GATCTCCGTC TACCTCGACG GCGAACTGGT TTACGGCCAG CTTCCGGGTG AGTCCGAGTC TGAGCCCGAC GACACGACCG CTCCGACGGC TCCCTCGAAC CTCTCTGTGG TCGAGACGAC AGCGTCCTCG GCCGAGGTCG AGTGGGAGGC CGCCAGCGAC GAGGGCGGTA GCGGCCTCGA TCACTACACC ATCTCCGTTG CGGGCGACTT CGACCAGCAG GTTGGGGCAG GGACGACGAC GGCGACTGTC GAGGAACTGG ACGCCGAGAC GACCTACGAG ATCGGCGTCT CGGCGGTCGA CGGTGCCGGC AACGAGTCCG ATACGGTAAC CGTCGAGGCG ACGACTGACG AGGCCGACGA CGGCGAGGAC GACAGCGATG ACGAGGAATC ACCGACGGAT GCCCTCGTCG TCAACGACTA CGACGGCGAC CCGGCGTGGT CGAGCAACCG GAACGACCTC GGCCAGTGGT GTGGGGCCGG CTCCTTCGAG AACGGGGCAG GCGAGGTAGC CGACGGGGCG CTGGTCCTCG AATACGACAA CGGTGGCTGG TACCAGGAAC AGATCAACCG GGATGTGAGC GACTATTCGA GCGTCGTCCT GGATGTGTGC GGTGCGAACG GTGGCGAGGA GAACGAAATC CGCTTTGCCA TGGGCGGCGT CTCGGGCCTT CTGGGTGATC TCACCGGCGA TTCGATCGGG ACGAGTGCCG GCGAGGTACG GATCGACATG GAATCAGCTG GCATCGACCC CACTGCTGAG GGACTCGCGG TGCGGCTAAA CTTCTGGCAG GGGGGTGAGA GCACCCTCGC AATCGAGGCG ATCCGTCTCG AGTAA
|
Protein sequence | MTEHDAKSGG MGRTTTDGDT DLFRRDLLAA MGLGAGSVAL GTDVATPSVV SRAAAQTDLG FDYAHALQQS LYFYDANRCG ATTMGNRLQW RGECHHSDTE IPLDAATEDG GTNLSGSFIE EYSDVLDPDG TGTIDVSGGF HDAGDHMKFG LPQSYSASTL SWALYEFEDA FRDVGSYDHM VDILRHFADY FLKSTFRDDE GNVVAFCYHV GEGSIDHNYW GPPELQSSEE YPRPAYFATP EDPASDQCAG TAAALTITSL VLESEDSAYA AECLDTAQAL YDFAVENRGL GYDGGFYDSS YDEDELSWAA VWLHIATEDD AYLDDILATD DSGTYTGYLG EIIDSTDDDW QNIWVHSWDT VWGGVFLKLA PITDDPEHWQ IARWNLEYLS GGSVEHEDDN DTNYASTSDA GFTVLNTWGS ARYNAAAQFQ AMVYRKYRDT EKAVALTDWA ATQMNYIMGD NSFGYSLIVG FTDDHAEHPH HRAAHGSKEN SMEEPEEHRH TLWGALVGGP DEDDTHVDET SDYVYNEVAI DFNAGLVGAL AGFNTFYDDT GEAVAEFPPG EEPIDAYYAE GEVLQENADR TQVRVTIHNE SIHPPHREDG LSARYFIDVS ELRDAGQSID AVSVEVQYDQ QSTMGDGSAD VSGPIAWDED AGIYYIELDW SGNQIYGARE IQISMIAEQD DNWESNWDPS NDPSFQDIGE AATVTEAISV YLDGELVYGQ LPGESESEPD DTTAPTAPSN LSVVETTASS AEVEWEAASD EGGSGLDHYT ISVAGDFDQQ VGAGTTTATV EELDAETTYE IGVSAVDGAG NESDTVTVEA TTDEADDGED DSDDEESPTD ALVVNDYDGD PAWSSNRNDL GQWCGAGSFE NGAGEVADGA LVLEYDNGGW YQEQINRDVS DYSSVVLDVC GANGGEENEI RFAMGGVSGL LGDLTGDSIG TSAGEVRIDM ESAGIDPTAE GLAVRLNFWQ GGESTLAIEA IRLE
|
| |