Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1657 |
Symbol | |
ID | 3747675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2154547 |
End bp | 2156325 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637774195 |
Product | beta-N-acetylglucosaminidase |
Protein accession | YP_379952 |
Protein GI | 78189614 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTT ATCATTCTCT CTTATTTCCC TTTTTTTCAT TGAAAAAAAA ATGGAGCCGC AAAAGCCGCC GACTCTTCAT GCTTTTTATG GTACTTACGT TGAGCGTTGG CAACCATGCA AGCGCTCGTG TAGGGGGTGA ATCGTGGCGT GCGGAGCAAA TTTTTAGCAA GCAGAGCGAT GAGCTTGAAG AGCAATTGCA CGCCATGAGC TTAGCCGACA AAGTTGGGCA AATGATTATT GCCGACATTG AGCCAACAGC TTTTTCTCCC CGCAATAAAA AGGTGCTGCT GCTGAGCCGT TTAGCGCAAG AAGGCAAAAT TGGCGGGGTT ATTTTTATGA AAGGCGATGC CAAAAGCACG GGCGCCGTAG TGAATCACTT GCAAGCTTTA GCGCCGTTGC CACTGCTTTT TAGCTCCGAT ATGGAACGAG GAGTTGCCAT GCGCATTAGT GGCACCACCG AGTTTCCGCC AAACATGGCG CTTGGCGCTA CCGCTGATCC AAAGTTAGCT GAAGACATGG CAACGGCTAT TGCTCAAGAA GCCACCTTGC TTGGAATGCA CCACAACTAC GCGCCAACGG TTGACCTTAA TAGTAATCCC CGCAACCCCG TTATTAACAC GCGTGCTTTT GGCGATACCA TTCCGCTTAC CATTGTAATG GCAAATGCCA TTATTAAGGG ATTGCAATCG CACGGCGTAC TTGCCACCGC GAAGCATTTT CCCGGACATG GCAACGTTAC GGTGGATAGC CATGTGGCGC TTCCCGTATT ACAAGCTACT CGTGAGCAAC TTGAGGCTTA CGAGCTTATT CCTTTTCGTG CAGCAATTGA GCAAGGCGTG GCAACCATTA TGGTGGGGCA TCTTGCCGTG CCAGCACTAA CGGGCAACAT GGAGCCTGCA ACCATTTCAC CTGCCATTGT AACCACGTTG TTGCGTCAAG AACTTGGCTT TAAAGGGCTA ATTATTACCG ATGCGCTTAA CATGAAGGCG CTTTATAACG GCAGCAACGT TGCCACTCTT TCGGTGCGAG CTGTGCAAGC AGGGAACGAC TTGCTGCTTT TTTCGCCCGA CCCCGAAGCT ACCCATAGCG CAGTTGTGCA AGCCGTTGAA GCGGGGCAAA TTCCCCTTGA GCAAATTAAC GCTTCGGTGC GACGCATTTT GCAAGCCAAA CAATGGCTAA AGCTTGAAAA GCATCGCGAG GTAGATAGTG AGGATATTGA AGAGGATGCC AATCCAGCAA GCCATCGCGA ACTTGCCCGC AAAATTGCTG AACATGCCGT AACGTTAGTG AGCGATGTGG AACGCAATGT GCCGCTTAAA AAGAGTGAGC AGCTCCTTCA CCTTATTGTA CAAGATCGGG TGAATTACCA AACAGGGCGC AATTACCTTC GCCAACTCAG CGAACGCTAT CCCACCATAA CTCATCTACG CATTAACCCT AAAAGCGATG CGCTTGATTA TGCTATTGCC ACCGAACTTG CCATGAACGC CTCAAGCGTG CTTGTAACAT CTTACGTGCA ATCGCTAAGT AGCAATGGCG AACTCAAACT TACTGCCGAA CAGCAAAACT TTTTGCACTT ATTACCGACG GTGGTTCAGC GTGGTACGCC CATGGTGTTG CTGTCGCTTG GCACGCCGTA TATTAGCAAC TATTTTCCAG AGTTTACAAG CTATCTTTGC ACCTACTCGT TTGACGAGGA GAGCGAACGT GCGGCTCTGC AAGTGTTGCA GGGCGAGCTT ACGCCTCGTG GTGTGCTGCC TATTGTGCTT GGGCAGTAG
|
Protein sequence | MSSYHSLLFP FFSLKKKWSR KSRRLFMLFM VLTLSVGNHA SARVGGESWR AEQIFSKQSD ELEEQLHAMS LADKVGQMII ADIEPTAFSP RNKKVLLLSR LAQEGKIGGV IFMKGDAKST GAVVNHLQAL APLPLLFSSD MERGVAMRIS GTTEFPPNMA LGATADPKLA EDMATAIAQE ATLLGMHHNY APTVDLNSNP RNPVINTRAF GDTIPLTIVM ANAIIKGLQS HGVLATAKHF PGHGNVTVDS HVALPVLQAT REQLEAYELI PFRAAIEQGV ATIMVGHLAV PALTGNMEPA TISPAIVTTL LRQELGFKGL IITDALNMKA LYNGSNVATL SVRAVQAGND LLLFSPDPEA THSAVVQAVE AGQIPLEQIN ASVRRILQAK QWLKLEKHRE VDSEDIEEDA NPASHRELAR KIAEHAVTLV SDVERNVPLK KSEQLLHLIV QDRVNYQTGR NYLRQLSERY PTITHLRINP KSDALDYAIA TELAMNASSV LVTSYVQSLS SNGELKLTAE QQNFLHLLPT VVQRGTPMVL LSLGTPYISN YFPEFTSYLC TYSFDEESER AALQVLQGEL TPRGVLPIVL GQ
|
| |