Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0013 |
Symbol | nagA |
ID | 4186894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | - |
Start bp | 11919 |
End bp | 14891 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638070011 |
Product | b-glucosidase |
Protein accession | YP_676647 |
Protein GI | 110636440 |
COG category | [G] Carbohydrate transport and metabolism [V] Defense mechanisms |
COG ID | [COG1472] Beta-glucosidase-related glycosidases [COG1680] Beta-lactamase class C and other penicillin binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAA AGATTTTATT ACAAGCAGGT ATACTGCTTT CACTTATATT GTTTTTTGTT GCTGGCGCGG CTACCTGGGA AAAGCCCGAA CCGTTCAAAG ATACCCTTGC AGAAGCATGG GCAGACAGCG TTATGTCTAC ACTCACTGAT GAACAGGCAC TGGGGCAACT GTTTATGGTT GCTGCGTATT CAAATAAACC TGAAGCCCAT GCACTGGAAA TTGAACAACT TATAAAAAAT AATGGTATTG GCGGTGTGAT TTTTTTTCAG GGCGGTCCGG TAAGGCAGGC CCAGTTAACC AACAGATATC AATCGGTAAG TAACGTGCCG CTGTTTGTTG CCATGGATGC GGAATGGGGG TTGGGCATGC GGCTGGACAG CACCATGAAT TTTCCCAAAC AAATGACGCT GGGTGCTATT CAGGATAATG CAGCTATTTT TAACATGGGC GTTGAAATCG GCAAACAATG TAAACGGCTT GGCGTACATA TAAATTTTGC ACCTGTTGTT GATGTTAACA GCAATGCAAA CAATCCGGTA ATTGGCGTGC GCTCGTTTGG TGAAGATAAA ATCAATGTAA GCCAGAAAGC AATCGCCTAT ATGAAAGGAA TGCAGTCTGT GCATGTGATG GCAAATGCAA AACATTTTCC GGGCCACGGA AATACAGATA CCGATTCACA TTTCTCTTTA CCGGTTGTTA ATAAAAATGT TCAGGAATTG AATGACACAG AGCTGTATCC TTTCAGACAG CTGATCGATT CAGGTGTTGG AAGCATTATT GTTGCTCACA TGAATGTACC TTCTTTAGAC AATACAAACA AACCGGCAAC GTTGTCCAAA CCAATTGTTA CGGATCTGTT ACGGAATGAT ATGGGCTTCA GGGGGCTCAT CTTTACAGAT GCGCTTAATA TGAAAGGAGT CAGCAATCTG TATAAACCGG GAGAAGTTGA TGTAAAGGCC CTGCTTGCGG GAAATGATAT CCTGTTATAT GCTGAAAATG TTCCCTTAGC CATAAAGAAA ATAGTAAAAG CAATTAATGA TAAGGATATA ACAAAAGAAG AAATTCATGC ACGCGTAAAA AAAATATTGT TGGCTAAATA TTGGGCCGGA CTAAATCATT TCAAAAAAAT AGAAACAGAA AATCTATATC AGGATCTGAA CAACGCCTCT GCAAAGGCTC TTTTAAATAA CTTATATAAA CAGTCACTTA CTGTTGCCCG GAATAAAAAC AACATTCTTC CATTTGTACT GGCAGATACA ACTTCTTTTG CGTCTGTAAG TATTTCATTT CATGGTTCTG AAAATATTTT TCAGCAAACC CTTAGTAACT ATGCTGCTTT TGATCACTAC TCCATAGAAA AATCGGGCAG CGACACGGCT CTGGTTTCTT TGGTAACTAA ACTGAAAAAA TATGAAGCGG TTATTGTTGG TGTACATCAG GTAAATTCTT ATAACTCTAA AAACTATGGA ATAAGTACTG CTACCAAAAC ATTTATACAG CAATTACAGG CCGCACATCC CAATGTTACG GTTGTGGTGT TTGGTATTCC GTATGCGTTA AAATATTTTT CAGATTCGAA AGTGCTGGTT TGTGCCAACG AAGACAATGC GTACACACAG CGCCTGGTGC CGCAATTGTT ATTCGGTGCG ATTCAGGTAA ACGGACGGTT GCCGGTAACC GCCGGTGGAA ATTTAAAATT GAATACCGGA CTTCCTGTTT CATATAATTG CATGCGTATG CGGTATGACT TGCCGGAGAA TTTACGTATG GACAGCAAAA CACTAAGTAA AATAGATACA ATTGTAATGC ACGCCATCAC AGAAGGCGCT ATGCCGGGCT GTCAGGTATT GGTCGCTAAA AAAGGCGCTG TGATTTATAA TAAATCATTT GGTTATTACA CCTATGACAA AAAGAATCCG GTAACATCCA CTACCTTGTA TGATATTGCT TCTATAAGTA AAGTTGCAGG AACATTACAG GCCATTATGT TCTTAGAAGA GCGCGGCCTG ATAAAATTGA ACTATAAAAT TTCTGTGTAC CTGCCCGACC TGATCGGCAC GAATAAAGAA GACCTGATCA TTCGGGATAT TCTAACGCAC CAGGCAGGCC TTCAGCCATT TTTGCCGCAC TGGCGAAAAA CAATGGATAG TTCAAACTTC AGTAAAAAAT ATTACAGCAC CATTAAAAGC GACAGCTTCC CGAACATGGT TATACCGGGG CTTTACAGCA TAGCGGGTAT TGAAGATTCC TTGTGGAAAT GGACCGTGCA GTCATCGTTA ATGGCACATC CCAAACAAGG GAAAAAGAAA TTGCCTTATT CCTATGTGTA TAGTGATCTT GGTTTTTATA TCATGAAGCG TTTGGCTGAA AGCCAGCTGG GACAGCCGAT GGAAGAGTTT TTAAAACAGA ACTTCTATGA TCCGTTAGGT CTGCAGAACT TCTATTATAA TCCGCTGGAA AATGGTGTAC CGGCTTCGCG GATCACACCA ACCGAACAGG ATAAGTATTT CAGAAAATCA CTGGTTGTGG GTACGGTGCA TGATCCGGGT GCGGCTTTAT TAGGGGGCAT TGGCGGGCAT GCAGGTATTT TCTGCAATGC CGAAGATCTG GCAACACTGA TGCAGATGAA TCTTCAGCTT GGCTATTATG GCGGTTATCG CTTTTTACTG CCTGAAACGA TCGAATTATT CAGCAAAACC CAATCAACAA AAAACAGACG TGGCTTAGGT TGGGATAAAC CCCAGGCAAG CAGCGGCGGA CCATGTTCTT ATTTAGTTTC TGCTTCCACC TATGGCCATA CCGGATTTAC GGGTACCTGT GCCTGGGTTG ACCCTGAACA GGAATTAGTT TATATTTTCC TTTCAAACCG GGTATACCCG GATGCGAACA ACACTAAGCT TATAAAGGAA AGTATTCGGA CTCAAATTCA AACGGTTATC TATAAATCCC TTCTTAATTT CAGAGAACAA TAA
|
Protein sequence | MRKKILLQAG ILLSLILFFV AGAATWEKPE PFKDTLAEAW ADSVMSTLTD EQALGQLFMV AAYSNKPEAH ALEIEQLIKN NGIGGVIFFQ GGPVRQAQLT NRYQSVSNVP LFVAMDAEWG LGMRLDSTMN FPKQMTLGAI QDNAAIFNMG VEIGKQCKRL GVHINFAPVV DVNSNANNPV IGVRSFGEDK INVSQKAIAY MKGMQSVHVM ANAKHFPGHG NTDTDSHFSL PVVNKNVQEL NDTELYPFRQ LIDSGVGSII VAHMNVPSLD NTNKPATLSK PIVTDLLRND MGFRGLIFTD ALNMKGVSNL YKPGEVDVKA LLAGNDILLY AENVPLAIKK IVKAINDKDI TKEEIHARVK KILLAKYWAG LNHFKKIETE NLYQDLNNAS AKALLNNLYK QSLTVARNKN NILPFVLADT TSFASVSISF HGSENIFQQT LSNYAAFDHY SIEKSGSDTA LVSLVTKLKK YEAVIVGVHQ VNSYNSKNYG ISTATKTFIQ QLQAAHPNVT VVVFGIPYAL KYFSDSKVLV CANEDNAYTQ RLVPQLLFGA IQVNGRLPVT AGGNLKLNTG LPVSYNCMRM RYDLPENLRM DSKTLSKIDT IVMHAITEGA MPGCQVLVAK KGAVIYNKSF GYYTYDKKNP VTSTTLYDIA SISKVAGTLQ AIMFLEERGL IKLNYKISVY LPDLIGTNKE DLIIRDILTH QAGLQPFLPH WRKTMDSSNF SKKYYSTIKS DSFPNMVIPG LYSIAGIEDS LWKWTVQSSL MAHPKQGKKK LPYSYVYSDL GFYIMKRLAE SQLGQPMEEF LKQNFYDPLG LQNFYYNPLE NGVPASRITP TEQDKYFRKS LVVGTVHDPG AALLGGIGGH AGIFCNAEDL ATLMQMNLQL GYYGGYRFLL PETIELFSKT QSTKNRRGLG WDKPQASSGG PCSYLVSAST YGHTGFTGTC AWVDPEQELV YIFLSNRVYP DANNTKLIKE SIRTQIQTVI YKSLLNFREQ
|
| |