Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0675 |
Symbol | |
ID | 8382944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 666467 |
End bp | 667546 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644971738 |
Product | peptidase M42 family protein |
Protein accession | YP_003129594 |
Protein GI | 257051761 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.873369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACC CACAGCGGGA CTTTCTCGAA TCGCTCCTCG ACACTGCCAG CCCATCGGGC TTTGAAACGC CCGGCCAGCG CGTCTGGATC GAGTACGTCT CACAGTTCGC CGACAACGTC AGGACTGATG ACTACGGCAA CGCCGTCGCG GTCCACGAGG GCGACGGGGA TCGCGAGATC GCGATCGCCG GCCACGGTGA CGAGATCGGA TTCATGGTCC GGGACATCAC GGACGATGGA TTCATCGAAC TGTCCCGGAT CGGCGGGTCG GATCGGACGG TCACTCGCGG CCAACACGTC ACTGTTCACA CCGATGCGGG ACCTGTCTCC GGCGTCGTCG GCCAGACGGC GATCCACCTG CGTGATCGTG AGGACGACAG TATCGACGAC GTAGCCGAAC AACACGTCGA CATCGGCGTC TCCGACGGCG AGTCCGCTCG CGAGCGCGTC GAGATCGGCG ATCCAGTGAC ATTTGCCTCG GGCCTCGAAT CGCTCGCGGG GACGCGGCTG TCGGCCCGCG GCATGGACAA CCGCGTGGGG ATCTGGACGG CCGCCGAAGC GCTCCGAACG GCAAGCGACG CCGACGCCTC GGCGACGGTC TACGCCGTCA GCACCGTCCA GGAGGAACTC GGACTCCAGG GCGCGAAGAT GGTCGGGTTC GATCTCGATC CCGACGCCGT GGTGGCGGTG GACGTCACGC ACGCGACTGA TACGCCCGAC GTGCCGGGAA AACGATCGAA CGGCGTCGAA TTAGGGGCCG GCCCGGTCGT CGCTCGCGGG AGCGCGAACC ACCCGCAGCT TGTCGAGTCA CTTCGCTCCG TGGCCGACGA GGAGAACGTC GACGTCCAGC TCGAAGCGAC GGGCATCCGG ACCGGAACTG ACGCCGACGC GTTCTACACC CAGCGTGGGG GCGTCCCGTC GGTGAACCTG GGGCTGCCGA ACCGCTACAT GCACACGCCC GTCGAAGTGA TCGACACCGC GGACCTGACG AACGCGGCCA CCCTCCTCGG GGCCTTCGCC GTCGCCGCCG AGTCGCTTGC GCCCTTCGGC GTCGAATTCG ACACTGAGTC GAGGGCCTAG
|
Protein sequence | MDDPQRDFLE SLLDTASPSG FETPGQRVWI EYVSQFADNV RTDDYGNAVA VHEGDGDREI AIAGHGDEIG FMVRDITDDG FIELSRIGGS DRTVTRGQHV TVHTDAGPVS GVVGQTAIHL RDREDDSIDD VAEQHVDIGV SDGESARERV EIGDPVTFAS GLESLAGTRL SARGMDNRVG IWTAAEALRT ASDADASATV YAVSTVQEEL GLQGAKMVGF DLDPDAVVAV DVTHATDTPD VPGKRSNGVE LGAGPVVARG SANHPQLVES LRSVADEENV DVQLEATGIR TGTDADAFYT QRGGVPSVNL GLPNRYMHTP VEVIDTADLT NAATLLGAFA VAAESLAPFG VEFDTESRA
|
| |