Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2781 |
Symbol | |
ID | 7269851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3418308 |
End bp | 3419678 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643567602 |
Product | beta-galactosidase |
Protein accession | YP_002464080 |
Protein GI | 219849647 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCATG CAATCCGCTT TCCGACGAAC TTTATCTGGG GCGCAGCCAC CGCTGCCTAT CAGATTGAAG GTGCCTGGAA CGAAGATGGA AAAGGCGAGA GTATTTGGGA TCGCTTTGTC CGCCGACCCG GTGCCATTGC CGATGGTAGT ACCGGTGATG TCGCCTGTGA CCACTATCAC CGGTACGAAG AAGACCTCGA ACATATGGCA GCAATGGGAC TGAAGGCGTA CCGTTTCAGC ATCGCATGGC CGCGCATCTT CCCCGACGGC ACCGGCCAAC CCAATCAGCG CGGGCTTGAT TTTTATCGCC GACTCATTGA CGGTTTGCAC CGACGTAGGA TTCTCCCGGT TGCAACTCTC TACCATTGGG ATCTGCCGCA AGCAATTGAA GATCGCGGCG GCTGGATCAA CCGAGATACG GCTTTTTATT TTGCCGAATA CGCCGATTAT CTCTTTCGCC AGATCGGTGG CGATGTTGCG CTCTGGGCTA CCCACAACGA GCCATTTATA CAGGCCTTCT ACGGCTACGG CAATGGTGAA AATGCGCCCG GTAAGCGAGT GCCGTGGCGA GTATTGCACG TCGTACACCA TCTTTTGTTG TCACACGGGC TGGCAGTGAG CGCTTTTCGC GCCACCAAGC CGCAACCGGT ACGCGCCGAT CTACCATCAC CCCAGATCGG GATTGTCCTT ATGATCTGGC CGCAGTATCC GGCCTCTGAT CATCCTGCTG ATCTTAAAGC TGCTCAGCGC ATCGACGGAG CAATGAACCG GCTCTTCCTC GAACCGCTGT TCCGCCGGCG CTACCCCGCC GATCTAGTAG CACACTTTGC TCGTCGGCTC ATCTTCGCGC CGGTCAAGCC CGGTGATATG GAGATTATCG GCCAGCCGAT CGATTTTCTC GGTATTAACA CCTACACGCG GCTCTTCAAT GCGGTGAACT GGCGCGAACC GTTTTTAATG ACTAAGCAGG TGCCGGGGCC GCTCCCCAAA ACGGCGATGG GCTGGGAGAT ATACCCCGAT TGTATCGTCG AGGCGTTGCA GAAGGCACGT GAGTATACGT CGCTTCCGCT GTACATCACC GAAAACGGCG CAGCGTTCGA CGATCCGCCA CCCGGCCCCA ACGATCAGAT CGTTGAAGAC CCGGATCGTG TTGCTTACCT CCGCTCTCAC ATTGCGGCTT GCCATCGCGC ACTGACCGCC GGGATCGATC TACGCGGTTA TTTTGTCTGG ACACTGATGG ACAATTTTGA GTGGGCGAAA GGGCTGAGCA AGCGGTTTGG GATTATTTAT ACCGATTATG CCACTCAACG CCGGGTGTGG AAACGCAGCG CGCACGTGTA CCGCGACATT ATCGCGCGCA ATGGGTTGTG A
|
Protein sequence | MNHAIRFPTN FIWGAATAAY QIEGAWNEDG KGESIWDRFV RRPGAIADGS TGDVACDHYH RYEEDLEHMA AMGLKAYRFS IAWPRIFPDG TGQPNQRGLD FYRRLIDGLH RRRILPVATL YHWDLPQAIE DRGGWINRDT AFYFAEYADY LFRQIGGDVA LWATHNEPFI QAFYGYGNGE NAPGKRVPWR VLHVVHHLLL SHGLAVSAFR ATKPQPVRAD LPSPQIGIVL MIWPQYPASD HPADLKAAQR IDGAMNRLFL EPLFRRRYPA DLVAHFARRL IFAPVKPGDM EIIGQPIDFL GINTYTRLFN AVNWREPFLM TKQVPGPLPK TAMGWEIYPD CIVEALQKAR EYTSLPLYIT ENGAAFDDPP PGPNDQIVED PDRVAYLRSH IAACHRALTA GIDLRGYFVW TLMDNFEWAK GLSKRFGIIY TDYATQRRVW KRSAHVYRDI IARNGL
|
| |