Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2499 |
Symbol | |
ID | 4568552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2865580 |
End bp | 2867349 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639767059 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_912911 |
Protein GI | 119358267 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.964917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA CACCGATACG CACGATTGCC ATCGCCATCC TCTTCCTCAT AGCCGCAGGA TCGATTTCTG CCGACGCCTT TGGAAAATCA AAGTCGAAAT CAAAGCCATC ATTCAGCAAA TGGCAGGCCC AGTCTATTTT TTCCTTCAGC GACTCGAAAA CAGAAAAAAC GCTGAAACAA ATGACCCTTT CGGAAAAAAT CGGTCAGATG ATTATTGCCC AGACCGAAGC GCGATCCGGA ATTACCACTG ACAGAGCAAC TCAACAGCTC GGCAGACTGG TACAGGAAGG CAAAGTCGGG GGCATAATGT TTATGAAAGG CGACGCCTTC AGCGCTGCAC TGCTCTCTAA CTACTTTCAG TCACTGACCG CGCGCCCGCT GCTCATGAGC GCCGATATGG AACGAGGACT TGCCATGAGG CTCAGTGGGG CAACCGAATT CCCCCCCAAT ATGGCTCTTG CCGCCACAAA AGAGACCAAA TTTGCCTTTG AAATGGCAAA AGCTATTGCA AAAGAGGCCC GGATTGTCGG GATACACCAG AACTATGCCC CAACCGTTGA CCTGAACATC AATCCGGCCA ACCCCATCAT CAACACCCGC TCCTTCAGCG ACAACCCTGC ACTGGCCATT GCCATGTCAA ACGCCGTTAT CGAAGGTCTC CAATCAAACG GAATTGCCGC AACGGCAAAA CACTTTCCGG GTCATGGCGA CGTCACCGTT GACAGTCACC TTTCGCTGCC AGTGCTGAAT GCCGACAGAG CCCGCCTCGA CGCCTATGAA CTCCAGCCGT TCAAAGCGGC TATCGACCAG GGAATCATCA GTATCATGAC CGGTCATCTT GCCGTACCGA AACTCACCGG CACCATGGAA CCGGCATCAA TTTCAAAAAC CATTGTTACC GATCTTCTTC GCAAAGATCT GGGCTTCACG GGATTGATCA TTACCGATGC CATGAACATG AAAGCGCTCT ACAACGGAAA CAACGTTGCC GAAATATCAG TAAAAGCCGT TCAGGCCGGC AACGACCTGC TTCTGTTCTC CCCCGATCCC GAACTGGCTC ACAACGCGAT TCTTAATGCC GTCGAAAACG GAGTAATCCC GAGAGAAAAT ATCGACGCCT CTGTCCGACG AATTCTGCAA CTCAAGCATT GGCTGGAAAT AGAACACAGA AAACTCGTTG ACCTCAATTC CGTTATGGAC AACATAAGTC CATCTGCGCA CCGCGACCTT GCCGAAAAAA TCACCCGGAA CTCCATAACG ATTGCTCAGA ATGCCAATAA CGTTATTCCT TTGAAAATCG GATCTTCCTC AGGCAACATC CTGAGCATTA TCCTGCAAGA CAAATCAAAC AGCGAAACCG GCAAACACTA TATCGATGAA ATCAACCGAT ACTATCCTGC CTCCCATCTG AGAATAGACC CGAAAAGCGA TGACCAGACC TTTGCTGCCG CTCTTGAATT AGCCTCGAAA GCACCGGCAG TTGTTATATC CTCTTACGTA CAGGTCTTCT CCGGTTCCGG AACCCTGAAG CTCACCCTGA AACAACAGGA ATTCATCCAC AAACTCGCAC AATCGCTTCC CGCAGGCAAA CCGCTGATCT TCATCTCTTT CGGCACGCCC TATCTGATCA ATGCCTTTCC TGAAATACAT GCCCACCTGT GCGCATACGC AGCAAACGAA ACAAGTGAAA CCTATGCAGT CAAGGCACTA CGGGGAGAGC TTAGCCCAAC GGGAACGCTG CCGGTATCGC TGCAGAGAAA CAGCCGATAA
|
Protein sequence | MNKTPIRTIA IAILFLIAAG SISADAFGKS KSKSKPSFSK WQAQSIFSFS DSKTEKTLKQ MTLSEKIGQM IIAQTEARSG ITTDRATQQL GRLVQEGKVG GIMFMKGDAF SAALLSNYFQ SLTARPLLMS ADMERGLAMR LSGATEFPPN MALAATKETK FAFEMAKAIA KEARIVGIHQ NYAPTVDLNI NPANPIINTR SFSDNPALAI AMSNAVIEGL QSNGIAATAK HFPGHGDVTV DSHLSLPVLN ADRARLDAYE LQPFKAAIDQ GIISIMTGHL AVPKLTGTME PASISKTIVT DLLRKDLGFT GLIITDAMNM KALYNGNNVA EISVKAVQAG NDLLLFSPDP ELAHNAILNA VENGVIPREN IDASVRRILQ LKHWLEIEHR KLVDLNSVMD NISPSAHRDL AEKITRNSIT IAQNANNVIP LKIGSSSGNI LSIILQDKSN SETGKHYIDE INRYYPASHL RIDPKSDDQT FAAALELASK APAVVISSYV QVFSGSGTLK LTLKQQEFIH KLAQSLPAGK PLIFISFGTP YLINAFPEIH AHLCAYAANE TSETYAVKAL RGELSPTGTL PVSLQRNSR
|
| |