Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2578 |
Symbol | |
ID | 4569093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2953213 |
End bp | 2954340 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639767143 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_912990 |
Protein GI | 119358346 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA ACCGATTCCC GCTCTCACTG CTGATCATGC TCTTGATGCT TCAGACCCCG GCTGCCTGGG CGGCAAAAGA GCCTGACAGT CTGGGAATCA AAATCGGCCA GATGATTATG ACCGGATTCA GAGGTTGCTC TCTTGCGGAA TCGCCGCAAA TTGCATCGGA TATCCGGCGG CAACGAATAG GCGGGGTCGT ACTCTTCGAC TACGACGTTC CATCCCGCTC GCCCATCCGT AACATCACAA CGCCCTCCCG GCTCATGAAA CTGACCAGAG AGCTTCAGGG AATAACGGAA ATTCCGCTCC TTATCGCCAT CGACCAGGAG GGAGGGCGGG TAAACCGCCT CAAACCCGCT CTCGGCTTTC CCCCGTCGCT CTCGGCCGCC CGGCTCGGAA AACTCGACAA TACCGACAGC ACAACCGCAG AGGCAGCCAA AACAGCGGAA ACGCTGAAAA CCATGCACCT GTCGATGAAC CTCGCCCCGG TCGTCGATCT CAACAGCAAC AAAGAGAACC CTGTCATCGG CAAACTTCAG AGAAGTTTTT CCGACGACCC GGACGTCGTC ACAAGAAACG CCCGGGCCAC CTGCAACGCA TTCCGCGAAA AAGGAATCAT TGCGACCCTC AAACACTTTC CCGGCCACGG CAGCTCAACC ACCGATACCC ACAAAGGATT TACCGACATT ACCGGCACCT GGCGCGAAAA CGAGCTCCAG CCATACCGTC AGCTCATAGC CGGAGGGTAC AACGACGCCA TCATGACCGC ACACGTCTAC AACGCAACGA TCGACAGCCT CTACCCCGCA ACGCTCTCAA AAAAAACACT CAAAGGAATC CTTCGTGAAA AACTCGGCTT CAGAGGGGTA ATCATCACCG ACGACATGCA GATGAAAGCG ATTGCCGACC ATTACGGACT CGAAGAGGCT CTCCGTCTTG CCATCGAAGC CGATGCCGAC ATCCTGCTGT TCGGCAACAA CACAACCTTT GACCCCGACA TCGCCAGAAA AGCCATTGCC ATCATCAGAA CGATGGTCAG TAAAAAAATC ATCACCACGG ACCGAATCGA CCGCTCCTAT CGAAGAATCA TGACGCTCAA AGAACGATAC CTCTTTCAAT GCAAATGA
|
Protein sequence | MKRNRFPLSL LIMLLMLQTP AAWAAKEPDS LGIKIGQMIM TGFRGCSLAE SPQIASDIRR QRIGGVVLFD YDVPSRSPIR NITTPSRLMK LTRELQGITE IPLLIAIDQE GGRVNRLKPA LGFPPSLSAA RLGKLDNTDS TTAEAAKTAE TLKTMHLSMN LAPVVDLNSN KENPVIGKLQ RSFSDDPDVV TRNARATCNA FREKGIIATL KHFPGHGSST TDTHKGFTDI TGTWRENELQ PYRQLIAGGY NDAIMTAHVY NATIDSLYPA TLSKKTLKGI LREKLGFRGV IITDDMQMKA IADHYGLEEA LRLAIEADAD ILLFGNNTTF DPDIARKAIA IIRTMVSKKI ITTDRIDRSY RRIMTLKERY LFQCK
|
| |