Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2390 |
Symbol | |
ID | 8384689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2437095 |
End bp | 2438666 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644973463 |
Product | Carbohydrate-binding family V/XII |
Protein accession | YP_003131289 |
Protein GI | 257053456 |
COG category | [R] General function prediction only |
COG ID | [COG3979] Uncharacterized protein contain chitin-binding domain type 3 |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01634] phage tail protein, P2 protein I family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.254143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACGAC GCACGAACGA TACCGGCGAA GTAGATGAGA AACCAAGTAG CGGTGCTGAG CAGCAAGGTT CGAACGACTC GACCGGCTCC AGGGACCCGT CTCGACGTGA CTTCCTGAAG GCCGGTGCCG CGGTCGGTGC AGGGACGTTC GCGGTCGGAC TCGGGCAGCA GGCCACGGCG ACCACGGCGA CGGACCCGTC GAATCTTGAT CTGTACCTGC TGTTCGGCCA GTCGAACATG GAGGGACAGG GACCGATCGA AGCCCAGGAC AGGGAGACCC ATCCGCGGAT CCACGTCCTC GCTGACAAGA CCTGTCCGAA CCTCGATCGC GAGTATGGCG AGTGGTATCT GGCCGAACCG CCGCTCAACC GATGCTATGG GAAGCTCGGT CCCGGCGATT ACTTCGCGAA GTCCATGATC GAGGAGATGC CGGACGACCG GTCGATCGGT CTCGTTCCCG CAGCCGTCAG CGGGGCCGAC ATTGCTCTCT TCGAAAAGGG GGCACCGATC GGTCGGAACG ACCGCGACAT CCCCTCCCAG TTCGACGGCG GCTACGAATG GATGGTCGAT CTTGCCGAAA CGGCCCAGCA AGTCGGGACG TTCAGGGGCA TTCTGTTCCA CCAGGGCGAG ACGAACACGA ACGATCAGCA GTGGACCGAT CAGGTCCAGG GTATCGTCGA GGATCTCCGC GCCGACCTCG GTATCGGCAA CGTCCCGTTC CTGGCGGGTG AGATGCTCTA TGACTCGGCT GGGGGGTGTT GTGGCTCGCA CAACACTGAA GTCAACGAAC TCCCGGACGT CATCGAGAAC GCTCACGTCG TCTCGGCTGA AGGACTTGCC GGCCAGGATT ACGCGCACTT CACGTCCGAA GCGTATCGAG AACTCGGCCG TCGCTACGCT GCGGAGATGC TGGAACACGT CGACGTCAGC GGCGGGACCG ACGACGGATC CGGCGGCAAC TCCGGTGATG ATTCGGGTGG CAACGATGGC GATGGGTCGG GCAGTGACTC CGATGATGAC TCGGACAGTG ACACTGGCGA CTCCGGCGAT GATTCGGGCA GTGATACCGG CGATAGTTCG GGCGATGACG CCGGTAGCGA CTCAGGGGGT TCCAGCGAGT ATCCCACGTG GGATTCAACT GCCGTTTATC GCACCGGCGA TCGGGTCGTC CACGACGGAC GCGTCTGGGA GGCCCAGTGG TACACCCAGG ATCAGGAACC CCGCGAGGAG GACTACTACG TCTGGCAACC TGTCGAGGAC GAAAGCGCCG GTAATTCCGG CGGTGACACC AGCGGGGAAT CGGGTGGTGA CACCGGTAAC TTGAACGCCG AGATGGATCC GAGCACGACA GCGGCCAGTG TCGGTGAGCG GGTCACGTTC CGCGTCACCG ACACGAGCGG TTCGAGCAAT TGGCTCACTT CTCTGGCGTT CGATTTCGGA GACGGGATGA CAGCCACCGG GTGGTGGGCT GCCCATTCCT TCGATTCGCC GGGCACCTAC ACCGTCACGC TCACCGCGAC CGACAACGGG GGTGCATCGA CCACTCACGA GGTGACGATC ACGGTCTCGT AA
|
Protein sequence | MTRRTNDTGE VDEKPSSGAE QQGSNDSTGS RDPSRRDFLK AGAAVGAGTF AVGLGQQATA TTATDPSNLD LYLLFGQSNM EGQGPIEAQD RETHPRIHVL ADKTCPNLDR EYGEWYLAEP PLNRCYGKLG PGDYFAKSMI EEMPDDRSIG LVPAAVSGAD IALFEKGAPI GRNDRDIPSQ FDGGYEWMVD LAETAQQVGT FRGILFHQGE TNTNDQQWTD QVQGIVEDLR ADLGIGNVPF LAGEMLYDSA GGCCGSHNTE VNELPDVIEN AHVVSAEGLA GQDYAHFTSE AYRELGRRYA AEMLEHVDVS GGTDDGSGGN SGDDSGGNDG DGSGSDSDDD SDSDTGDSGD DSGSDTGDSS GDDAGSDSGG SSEYPTWDST AVYRTGDRVV HDGRVWEAQW YTQDQEPREE DYYVWQPVED ESAGNSGGDT SGESGGDTGN LNAEMDPSTT AASVGERVTF RVTDTSGSSN WLTSLAFDFG DGMTATGWWA AHSFDSPGTY TVTLTATDNG GASTTHEVTI TVS
|
| |