Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1972 |
Symbol | |
ID | 3747834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2503979 |
End bp | 2505103 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774508 |
Product | glycosy hydrolase family protein |
Protein accession | YP_380263 |
Protein GI | 78189925 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGGC GTTTGCGGAT GGTTGCGTTG GTGCTGCTAT GGCTTTGCAT GGCAGCACCT CTCGCTATGG CGGCAGCTTC GCCCGATAGC CTTTCCATAA AAATGGGGCA AATGGTGATG GTTGGCTTTC GTGGCACCTC GCTTGCAGAG TCGCCCCAAA TTGTTGCTGC TATTCAAAAG CGCCACATTG GCGGCGTGGT GCTTTTTGAC TACGATGTTC CCTTTGCCTC CCCTACTCGC AACATTACAA GTCCCTCGCA ACTTGCTCGT TTAACGCAAG AGTTGCAAGA GCATTCAGCA ATTCCACTCT TTATTGCTAT TGATCAAGAA GGGGGAAAGG TCAACCGTCT TAAAGCCTCA CGCGGTTTTC CCGTAACAAT ATCAGCCGCA AAGTTAGGAG CCTTAAACCA GCCCGACTCG ACCCGATCCG CTGCTCGTCA AATAGCAAAA ACGCTCAGGG CAATGCACGT TAACATGAAT TTTGCGCCCG TTGCCGATCT CAATCGTAAT CCCAATAATC CCGTTATTGG CAAGGTAGAG CGAAGCTTCT CAGCCGATGC TGCACGAGCC ACCACCCATA TTCGCTTAAC GGCTGATACC TATCGTGCGG AAGGCATTAT TCCGACGCTC AAGCATTTTC CGGGACATGG CAGCTCCACC ACCGATACCC ATCTTGATTT TACCGATGTA AGTAATTCGT GGAGCAAGGA AGAGCTGGAA CCGTATCGCT CCTTAATTGC CGACGGCTAT GAGGATGCTA TTATGACGGC GCATGTTTTT AATGCTACGC TTGACCCAAC CTATCCCGCC ACGCTTTCAA AGCCAACGCT TGATGGGGTG TTGCGCCAAC AGTTAGGCTT TAAGGGAGTT ATTATTAGCG ATGATATGCA GATGGGAGCT ATTGCAGCGC ACTATGGGCT TGAAAGTGCT ATTCGCCTTG CTCTCAACGC TGGAGTTGAT ATTCTTTTGT TTGGCAACAA TACCGCTTAC GATGAAGCTA TTGCCGAAAA AGCTCTTGCA ATTATTCATG CACTTATTGA ACGTGGAGAA ATTCAGCCAA GCCGTATTGA AGAATCGTAT CGCCGCATTA TGGCACTAAA GCAGAGGTAT GGGGTGGTGA GGTAA
|
Protein sequence | MRRRLRMVAL VLLWLCMAAP LAMAAASPDS LSIKMGQMVM VGFRGTSLAE SPQIVAAIQK RHIGGVVLFD YDVPFASPTR NITSPSQLAR LTQELQEHSA IPLFIAIDQE GGKVNRLKAS RGFPVTISAA KLGALNQPDS TRSAARQIAK TLRAMHVNMN FAPVADLNRN PNNPVIGKVE RSFSADAARA TTHIRLTADT YRAEGIIPTL KHFPGHGSST TDTHLDFTDV SNSWSKEELE PYRSLIADGY EDAIMTAHVF NATLDPTYPA TLSKPTLDGV LRQQLGFKGV IISDDMQMGA IAAHYGLESA IRLALNAGVD ILLFGNNTAY DEAIAEKALA IIHALIERGE IQPSRIEESY RRIMALKQRY GVVR
|
| |