Gene Cagg_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3701 
Symbol 
ID7268237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4497874 
End bp4499064 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID643568508 
Producttryptophan synthase subunit beta 
Protein accessionYP_002464973 
Protein GI219850540 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00142022 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGATG TTGTTTCACG ACCGGGACGA TTCGGTCCAT ATGGTGGTCG GTATGTACCT 
GAAACGCTGA TGCCGGCGGT TAGTGCGTTA GAGGAGGCGT ATGAAGCAGC CAAAGCCGAT
CCATCGTTTT GGGAAGAATT AGCAGCCCTC CACCGCACCT ATACCGGTCG ACCAACACCG
TTAACCTTTG CCGCCCGATT AACTGCCCAC TGCGGTGGCG CACGCATCTA TCTCAAGCGC
GAAGATTTGG CCCATACCGG CGCACACAAG ATCAACAATG CGCTCGGGCA GGGCTTGTTG
GCGAAACGAA TGGGCAAGCG GCGCGTGATT GCCGAAACCG GCGCCGGTCA GCATGGGGTA
GCGACTGCCA CCGTCTGCGC GTTGCTCGGT CTGGAGTGCG TGGTCTATAT GGGGGTCGAT
GATATGGCCC GCCAGCGTCC CAATGTCTTC CGTATGCGGT TGCTGGGGGC TGAAGTACGT
GGGGTGAGCA GTGGTTCACG CACGTTAAAA GACGCGATCA ACGAAGCAAT GCGCGATTGG
GTGACGAATC CGGACAGCTA TTACCTGCTT GGCTCGGCGC TGGGGCCGCA CCCCTACCCG
ACCATGGTGC GCGACTTTCA GCGCGTCATC GGGATTGAAG CGCGCGAGCA AATCATCGCT
GCCACCGGTC GGTTGCCCGA TATGGTAATT GCCTGTGTGG GCGGTGGCTC GAACGCCATC
GGTATCTTTC ACCCGTTCCT CGACGATCCT GAGGTAGCGT TGCGTGGCGT TGAAGCCGGT
GGACGCGGTG AACGACTCGG TGAACATGCC GCTCGCTTTC GTGCCGTGAC TCCCGGTGTG
CTGCAGGGCA CCTTTTCGTA TGTGCTACAA GACGAGTTCG GGCAGATCGC GCTTACCCAT
TCAGTCAGTG CCGGCTTGGA TTATGCCAGC ATCGGTCCCG AACACGCATG GCTCCACGAT
ACTGGACGGG CGACCTACAC TGCTGCCGGT GATGACGAGG CGTTGGCCGC GTTCCAGTTA
TTGGCCAAGC TCGAAGGGAT TATCCCAGCA TTAGAGAGTG CCCACGCGGT GGCCGAGGCG
ATCAAGGTCG CCCCGACAAT GCGGCCTGAT CAGACCATTC TGGTGAACTT ATCGGGGCGA
GGCGATAAAG ATATCTTTAC CGTCGCTGAT CTGTTAGGGG TCGAAATCTA G
 
Protein sequence
MQDVVSRPGR FGPYGGRYVP ETLMPAVSAL EEAYEAAKAD PSFWEELAAL HRTYTGRPTP 
LTFAARLTAH CGGARIYLKR EDLAHTGAHK INNALGQGLL AKRMGKRRVI AETGAGQHGV
ATATVCALLG LECVVYMGVD DMARQRPNVF RMRLLGAEVR GVSSGSRTLK DAINEAMRDW
VTNPDSYYLL GSALGPHPYP TMVRDFQRVI GIEAREQIIA ATGRLPDMVI ACVGGGSNAI
GIFHPFLDDP EVALRGVEAG GRGERLGEHA ARFRAVTPGV LQGTFSYVLQ DEFGQIALTH
SVSAGLDYAS IGPEHAWLHD TGRATYTAAG DDEALAAFQL LAKLEGIIPA LESAHAVAEA
IKVAPTMRPD QTILVNLSGR GDKDIFTVAD LLGVEI