Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3701 |
Symbol | |
ID | 7268237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4497874 |
End bp | 4499064 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643568508 |
Product | tryptophan synthase subunit beta |
Protein accession | YP_002464973 |
Protein GI | 219850540 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0133] Tryptophan synthase beta chain |
TIGRFAM ID | [TIGR00263] tryptophan synthase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00142022 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGATG TTGTTTCACG ACCGGGACGA TTCGGTCCAT ATGGTGGTCG GTATGTACCT GAAACGCTGA TGCCGGCGGT TAGTGCGTTA GAGGAGGCGT ATGAAGCAGC CAAAGCCGAT CCATCGTTTT GGGAAGAATT AGCAGCCCTC CACCGCACCT ATACCGGTCG ACCAACACCG TTAACCTTTG CCGCCCGATT AACTGCCCAC TGCGGTGGCG CACGCATCTA TCTCAAGCGC GAAGATTTGG CCCATACCGG CGCACACAAG ATCAACAATG CGCTCGGGCA GGGCTTGTTG GCGAAACGAA TGGGCAAGCG GCGCGTGATT GCCGAAACCG GCGCCGGTCA GCATGGGGTA GCGACTGCCA CCGTCTGCGC GTTGCTCGGT CTGGAGTGCG TGGTCTATAT GGGGGTCGAT GATATGGCCC GCCAGCGTCC CAATGTCTTC CGTATGCGGT TGCTGGGGGC TGAAGTACGT GGGGTGAGCA GTGGTTCACG CACGTTAAAA GACGCGATCA ACGAAGCAAT GCGCGATTGG GTGACGAATC CGGACAGCTA TTACCTGCTT GGCTCGGCGC TGGGGCCGCA CCCCTACCCG ACCATGGTGC GCGACTTTCA GCGCGTCATC GGGATTGAAG CGCGCGAGCA AATCATCGCT GCCACCGGTC GGTTGCCCGA TATGGTAATT GCCTGTGTGG GCGGTGGCTC GAACGCCATC GGTATCTTTC ACCCGTTCCT CGACGATCCT GAGGTAGCGT TGCGTGGCGT TGAAGCCGGT GGACGCGGTG AACGACTCGG TGAACATGCC GCTCGCTTTC GTGCCGTGAC TCCCGGTGTG CTGCAGGGCA CCTTTTCGTA TGTGCTACAA GACGAGTTCG GGCAGATCGC GCTTACCCAT TCAGTCAGTG CCGGCTTGGA TTATGCCAGC ATCGGTCCCG AACACGCATG GCTCCACGAT ACTGGACGGG CGACCTACAC TGCTGCCGGT GATGACGAGG CGTTGGCCGC GTTCCAGTTA TTGGCCAAGC TCGAAGGGAT TATCCCAGCA TTAGAGAGTG CCCACGCGGT GGCCGAGGCG ATCAAGGTCG CCCCGACAAT GCGGCCTGAT CAGACCATTC TGGTGAACTT ATCGGGGCGA GGCGATAAAG ATATCTTTAC CGTCGCTGAT CTGTTAGGGG TCGAAATCTA G
|
Protein sequence | MQDVVSRPGR FGPYGGRYVP ETLMPAVSAL EEAYEAAKAD PSFWEELAAL HRTYTGRPTP LTFAARLTAH CGGARIYLKR EDLAHTGAHK INNALGQGLL AKRMGKRRVI AETGAGQHGV ATATVCALLG LECVVYMGVD DMARQRPNVF RMRLLGAEVR GVSSGSRTLK DAINEAMRDW VTNPDSYYLL GSALGPHPYP TMVRDFQRVI GIEAREQIIA ATGRLPDMVI ACVGGGSNAI GIFHPFLDDP EVALRGVEAG GRGERLGEHA ARFRAVTPGV LQGTFSYVLQ DEFGQIALTH SVSAGLDYAS IGPEHAWLHD TGRATYTAAG DDEALAAFQL LAKLEGIIPA LESAHAVAEA IKVAPTMRPD QTILVNLSGR GDKDIFTVAD LLGVEI
|
| |