Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0302 |
Symbol | |
ID | 8382566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 286222 |
End bp | 287667 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644971361 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003129222 |
Protein GI | 257051389 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0480106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACG AAGAAAAAAC ACGTAACGCA GAACACGACG ACACGCGATC GATCGACCGA CCAACGGGCG ACAAGCACAC CCTGTCGACA CCATCACGGC GACGGTTCCT CCAGGCAACA GCGGGAACCG GAATCGCCGT CGCGAGTGGT GGCTTGGTGG GGCATGCAGC GGCCGATTCA CACACGAGTG CGCTCCCACC GCTCAGACGG AACGGTAATC AGATCGAGAC ACCCGATGGG GAGACGATCG AACTCCACGG CGTCAATATC GCCGATCCAA AGCGAGTCGA CGTGACCGCG TCGGTACGCG GGAAAGATGC CGCCCAGGCG ATCGATCTTG CGACAGACCC CAGTCAGGGC TGGCATGCCG ACGTCGTCCG GCTGCCGGTC CAGCCGGTCG ACATCGGCGA ACACCAGGCT GGAGCGGGAC CGGAGCCGCC GGCGTTCGAC GAATCTCAGC TGCAGGACTA TATCGAGAAC CATCTCGATC CGGCCGTCCA ACAGTGCGCG GCAAACGGGG CGTACGCGAT CATCGACTAC CACCGGCATC GCGACATTCC GTGGACGGAT GACGGGCTCA GCCAGGAAGT GACGATGTTC TGGGACATCG TCGCCCAGCG GTACGGTGAG ATGGATCACG TCATCTTCGA AGTATACAAC GAACCCCAGG GCAACCCGAA CTACGGCGTC TCTGGCCAGG AGTTAGTGGA CTTCTGGGGT GAGTGGAAAG CCACAGCCCA GCCGTGGGTC GACACCGTTC GGGAGTACAC AGAGAATCTC GTGCTCGTCG GGTCACCGCG GTGGTCACAG ATGACCTTCG GTGCGGTGAT CGAGGAGTTC GATGGGGCAA ATATCGGCTA TACACTCCAC CTCTATCCGG GACACGGACC GACGACACCT GGAGACTACG ATGACTTCGT CACGCCGGTC AACAACGGCG GCGGAGATGT TCCCTACGAC GACGAGACGC CCGCCTGGGA GGTCGCGCCG GTCTTCATGA CCGAGTGGGG ATTCGATTAC GACGCCGACC CGGCCGCGGG CGGCGGCGTC AACGAGGATA CGGCAGCTGG CGCGGACTGG GCGGAGCACG ATTCGGATTT CGGCCACCAC GTGACCGAGT GGCTCTCGAC GCGCCCGGTG CATTCGACCG CGTGGGTGTT CGACGTGCTC TGGGACCCCA ACATGTTCAC GCGTGGGTTC GATGTCCCCG ACGACGAAGG CTGGGCACCC TATACGGGTG GGGAGATCCC GGAATACGTC ACTGATCGGC CTGCCGAATG GGTTCTTCAG ACCGGCGACG GCGAAGCTGA TATCGACGAC GTCCGGTGGC TCCTGAATAA CCGCGACAGC GAGGCAGTCA AGACGAACCC GGACGCCTAC GACTTCGACG GTGACGGGAC CGTCGGCGTC GGCGACATCC TGGCACTATT CAGATCGATC TACTGA
|
Protein sequence | MIDEEKTRNA EHDDTRSIDR PTGDKHTLST PSRRRFLQAT AGTGIAVASG GLVGHAAADS HTSALPPLRR NGNQIETPDG ETIELHGVNI ADPKRVDVTA SVRGKDAAQA IDLATDPSQG WHADVVRLPV QPVDIGEHQA GAGPEPPAFD ESQLQDYIEN HLDPAVQQCA ANGAYAIIDY HRHRDIPWTD DGLSQEVTMF WDIVAQRYGE MDHVIFEVYN EPQGNPNYGV SGQELVDFWG EWKATAQPWV DTVREYTENL VLVGSPRWSQ MTFGAVIEEF DGANIGYTLH LYPGHGPTTP GDYDDFVTPV NNGGGDVPYD DETPAWEVAP VFMTEWGFDY DADPAAGGGV NEDTAAGADW AEHDSDFGHH VTEWLSTRPV HSTAWVFDVL WDPNMFTRGF DVPDDEGWAP YTGGEIPEYV TDRPAEWVLQ TGDGEADIDD VRWLLNNRDS EAVKTNPDAY DFDGDGTVGV GDILALFRSI Y
|
| |