Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2398 |
Symbol | |
ID | 8384697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2456668 |
End bp | 2459682 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644973471 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_003131297 |
Protein GI | 257053464 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0137535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG AAGCGACCGA ATCGATTGAA GCATCGGCGA CTGATCACAC TGACGAGACA GCTGGAAATC GCAAGGACCC CGGTCTCACC TCGTCACGCC GGACGTTCCT CGGGGCGATG GCGAGTGCTG GGACGATCGG TGCTGGGCTT TCGGCCGCCA CTGGGACCGC TGCCGCCGGT GTGCCGACGC CACGGCTGCA CACCGAGGGG CGGTGGATCC GCGATCCGGC GGGCAACGAC GTGACGCTCC GGGGGATGGC ACCCGCTGAC CCCGGTTTCT ACCGGCAGTA CCATCCCAAG AGCTTCGAGG AAGTGCTGGA GTGGGCGACT GACACGGATC GGGGCTGGCA TCCCAACATT GTCCGGCTAC CCTGTACGCA GGACTCGATC GACGCGCTGG GCCTGGAAAC GTACGTCACC GAGGTCCTCC GCCCCGCGGT CGACCTGCTG GCCGCGCGGG ATGTCTACGC GCTGGTGGAC TTCCACCTCA TCAGGCCCTA CACACAGGAT GCAACCGAGA CGTACAACGA GGAAAACGAC GACGACCTCG CGCCGATCGA CGACGTGATG ACGACCTTCT GGGATCGGGT CGCCCCGGAG TTCGCCGAGG ACGAACACGT CATCTACGAG CTGTTCAACG AGCCGACCCA GCCGGCGATG TACGGCGACG ATGCCGGTGC CTTTCAGGCC TGGCGGGACG CCGCCCAGCC GTGGGTCGAC CTCGTCCGCG AACACGCGCC GGAGACGCCG ATCATCATCG GCTCGCCGCG GTGGACGTCG GTGACCCACA TGGCGCCGGA GTATCCCTTC GATGGGGAGA ACCTGATCTA CGCGGCGCAC ATCTACCCCG ACAACGGCCC GCCCGCGGAC TTCGACCAGT GGTACGGCGA ACCCGCCACC GAAGTCCCGG TCGTCGTCAC GGAGTTCGGC TGGGAACCCA CCGGGGGCTC CGTCGATCAG GGCACCACCT CCGGGTGGGG CGAGCCGTTC CGCGAGTGGG TCGAGGGCTA CGAGAACATG GGGTGGATCT CGTGGTGTTT CGACGACTCC TGGGAGCCGG CCTTCTTCGA GTCGCCGGAC GCTGGGGCCA ACGAGCCCTG GACGCTCAAG GACGACGCAG ATCAGATGGG GGGGTACATC AAGACCTGGC TGGAGGCAAC CAAAGATCAG GGCATCCCGG AGAGTGCGAT CGACGACGAC GTCGCGCCGC CGGTTCCATC CGGCCTCGAG GTGACCCGTT CGACCGAGAT CAGCGTCGAG ATCGCCTGGA ACGCCGTCAC CGACGAGGGC GAGGCCGGCC TCTCCCATTA CAACGTCTAC GTCGACGGCG AGCGCCGCGG GCAGGTGATC GACGGGACGG CGACGACGGT CGACGGCCTG GAGCCGGCTT CGACCTACGA GGTCGGTGTT TCTGCCGTCG ACAGTGCGGG CAACGAGTCC AATCAGACGA CGACGGTCGC CGAAACGATT GCCACCGACG CCGGCCAGTC GGCGTTCGTC GAGCACGAAC TCCCGGGCCG CATCCAGGCC GAGGACTTCG ACGAGGGTGG CCAGGGAATC GCCTATTACG ATACAGGATC CACGAACGAG GCCGGGGCCG ACTACCGCGA GACGGGCGTC GACATCGGGA CGGCCGTCGA GTCGGGGTAC AACGTCGGCT ACACCGAGAC CGGCGAGTGG CTCGAGTACA CTGTCACCGT CGAATCCGGT GGTAGCTACG AGGCCACCGT TCGGGTTGCC AACGGCGCTG ATTCGGGTGG CGACCTCCGG ATCGAGGTCG ACCGCGCCGA GGTGGCGACA CAGAACGTCT GGCCGACCGG CGGCTGGGAG AACTTCGAGG AGATCCGTGT CGGCGAGGTC GACATCCCCG AGGGCGAGCA CGTCATCCGG ATCGTCGTCG AGACCAGCGG CTGGAACTTC GACTGGATCG AGTTCACTGG CGGCGACGGC GGCGGCGAGG ACGTGACCCC GCCGACTGCT CCCTCGAACC TCTCAGTGAC CACGACGACG CCGTCATCCG CCGAGATCGC GTGGGATGCC GCGACCGACG AGGGCGGGAG CGGACTCGAT CACTACGCGG TGTACGTCGA CGGGAGTCTC GATCAGCAGG TTCCGACCGG GACCACGTCG GCGACGATCG CGGATCTCGC GGCCGAGACG AGCTACGAGA TCGGCGTCTC GGCCGTCGAT GGGGCAGGCA ACGAGTCCGA ATCGGTGACT GTCGACGTGA CGACCGACGC CGGCGACGAC ACGACCCCGC CGACTGTCCC CGGCGACCTC TCGGTCGATG GGACGACGGC CACGTCGATC GACGTCGCCT GGAGTGGTGC TTCGGACGCC GGCACGGGTG TCGACGCCTA CGCCGTCTAC GTCGACGGGA GCCGTGATCA GGCGGTTAAG GCAGGGACGA CGACGGCGAC GATCGACAGC CTCTCGGCGG TGACGACCTA TGAGGTCGGG GTTTCGGCGA TCGACGGGGC CGGCAACGAG TCGGCGACGG CGACCGTCGA GGCCACCACC GACCAGAGCG ACGACGGCGA AGACGATGAG GACGACGAAT CACCGGCAGA CGCCCTGGTC GTCAACGATT ACGACGGCGA TCCGTCGTGG TCGAGCAATC GCAACGACCT CGGCAAGTGG TGCGGGGCCG GGTCGTTCCA GAATGGTACT GCCGGTGGCG GTGCGGTCGA GGACGGTGCG CTGGTCCTCG AATACGACAA CGCCGGGTGG TTCGTCGAAC AGGTCCAGCA AGACGTCAGC GACTACTCGA CGGTCGTGTT GCGGGTCAGC GGGGCGAACG GCGGCGAGGA GAGCGAGTTC CTCTTCGACA TGGGCGGTGC GCGCGACCTG CTCGCGAATC TGACCGACGA CTCGATCACG ACGAGTGTCA CTGACGTCGC GATCGACATG GAGTCGGCCG GGATCGACCC GTCGGGCGGG GGACTCTCGA TCCGCCTGAA CTTCTGGCAA GGAGGTGCGA GCACGCTCGA AATCGAAGAG ATCCGACTCG AATAG
|
Protein sequence | MTDEATESIE ASATDHTDET AGNRKDPGLT SSRRTFLGAM ASAGTIGAGL SAATGTAAAG VPTPRLHTEG RWIRDPAGND VTLRGMAPAD PGFYRQYHPK SFEEVLEWAT DTDRGWHPNI VRLPCTQDSI DALGLETYVT EVLRPAVDLL AARDVYALVD FHLIRPYTQD ATETYNEEND DDLAPIDDVM TTFWDRVAPE FAEDEHVIYE LFNEPTQPAM YGDDAGAFQA WRDAAQPWVD LVREHAPETP IIIGSPRWTS VTHMAPEYPF DGENLIYAAH IYPDNGPPAD FDQWYGEPAT EVPVVVTEFG WEPTGGSVDQ GTTSGWGEPF REWVEGYENM GWISWCFDDS WEPAFFESPD AGANEPWTLK DDADQMGGYI KTWLEATKDQ GIPESAIDDD VAPPVPSGLE VTRSTEISVE IAWNAVTDEG EAGLSHYNVY VDGERRGQVI DGTATTVDGL EPASTYEVGV SAVDSAGNES NQTTTVAETI ATDAGQSAFV EHELPGRIQA EDFDEGGQGI AYYDTGSTNE AGADYRETGV DIGTAVESGY NVGYTETGEW LEYTVTVESG GSYEATVRVA NGADSGGDLR IEVDRAEVAT QNVWPTGGWE NFEEIRVGEV DIPEGEHVIR IVVETSGWNF DWIEFTGGDG GGEDVTPPTA PSNLSVTTTT PSSAEIAWDA ATDEGGSGLD HYAVYVDGSL DQQVPTGTTS ATIADLAAET SYEIGVSAVD GAGNESESVT VDVTTDAGDD TTPPTVPGDL SVDGTTATSI DVAWSGASDA GTGVDAYAVY VDGSRDQAVK AGTTTATIDS LSAVTTYEVG VSAIDGAGNE SATATVEATT DQSDDGEDDE DDESPADALV VNDYDGDPSW SSNRNDLGKW CGAGSFQNGT AGGGAVEDGA LVLEYDNAGW FVEQVQQDVS DYSTVVLRVS GANGGEESEF LFDMGGARDL LANLTDDSIT TSVTDVAIDM ESAGIDPSGG GLSIRLNFWQ GGASTLEIEE IRLE
|
| |