Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2478 |
Symbol | |
ID | 8825331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2538365 |
End bp | 2539444 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | Cellulase |
Protein accession | YP_003480600 |
Protein GI | 289582134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCTCA CCGACGATAC CGTGCCGTTC GATATCGACC TTTTGACAGA ACTCACCGAG ACCAGCGGTG TTCCTGGATA CGAAGACCGC ATCCGCGACC TGGTCGTCGC AGAACTCGAG TCCAGTGTCG ACGAGGTCCG GACCGACGCG ATGGGGAACG TCGTCGGGAC ACTCGAGGGC GACTCCGACT ACTCCGTTGC CGTCGCTGCA CACATGGACG AAATCGGCTT TATGGTTCGC CACGTCCGCG GGAAGGAAGG CGAGGACGGC TTCGTCGAAC TCGATCCGCT CGGCGGCTGG GACGCACGTG TTCTCAAGGC CCAGCGTGTT ACGATCCACA CCGACGACGG CGACCTACCG GGCGTCATCG GTTCGCCGCC GCCACACACA CTGGACGAGG AAGACCGCAA GAAGACGCCG GAGGTCGAGG ACACCTACGT CGATGTCGGT CTTTCCTACG AGGATGCCAA CGAGCGCATC TCACCCGGCG ACCTCGTGAC GATGGACCAG TCGACCGAAC TCGTCGGCGA AACGGTCACC GGCAAGGCAC TCGACGACCG CATCTGCCTG TTCGCGATGC TCGAAGCCGC CCGCCGACTC GAGACCCCCG ACGTGACGAT CCACTTCTGT GCGACCGTCC AGGAGGAAGT CGGCCTGCGC GGGGCAAACG CACTCGGCGT CGACGTCGAT CCCGACCTCG CTATCGCCCT CGACGTCACC GTCGCGAACG ACGTACCCGG CTTCGAGGAC GGAGAGCACG TCACCAAACT CGGGGAGGGG ACCGCGATTA AACTCAAGGA CTCGAGTGTC ATCACGAATC CGAAGGTCCA CCGTCGACTC CAGTCGGTCG CCGACGAGGA GGGAATCGAG TCTCAACTCG AGATCCTTCC GGCAGGCGGC ACCGACACGG CTGGGTTCCA GAATACGGCA GGTGCGAAGC CGGTGGGTGC GATTTCGATT CCGACGCGAT ACCTGCATAC GGTGACGGAG ACGGCCCACG TCGAGGACGT GGCGTCGACG ATCGATCTGC TCGAGGCGTT CCTCGCGAGC GAGGACGGCG AGCACGACTA CACGTTGTAG
|
Protein sequence | MSLTDDTVPF DIDLLTELTE TSGVPGYEDR IRDLVVAELE SSVDEVRTDA MGNVVGTLEG DSDYSVAVAA HMDEIGFMVR HVRGKEGEDG FVELDPLGGW DARVLKAQRV TIHTDDGDLP GVIGSPPPHT LDEEDRKKTP EVEDTYVDVG LSYEDANERI SPGDLVTMDQ STELVGETVT GKALDDRICL FAMLEAARRL ETPDVTIHFC ATVQEEVGLR GANALGVDVD PDLAIALDVT VANDVPGFED GEHVTKLGEG TAIKLKDSSV ITNPKVHRRL QSVADEEGIE SQLEILPAGG TDTAGFQNTA GAKPVGAISI PTRYLHTVTE TAHVEDVAST IDLLEAFLAS EDGEHDYTL
|
| |