Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2120 |
Symbol | |
ID | 7267628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2610513 |
End bp | 2612024 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643566954 |
Product | anthranilate synthase component I |
Protein accession | YP_002463442 |
Protein GI | 219849009 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.016796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTATC CAACCCTTGA CCAGATGTAT GAATTACGCC GGCAGGGCAA TCTCTGCCCG ATCTACCGTG AGATTATGGC CGACCTTGAA ACGCCGGTCT CGGCCTATCT CAAGATCGCG CAGAACGGTC TCGGCTTTCT GCTCGAAAGT GTGACCGGAG GGCAGAATAT CGGTCGTTAT TCATTTATCG GGAGCGATCC GTATATGGTT CTGCGCATGC ACGACGGTGT GGCGCAAGCG ACACATGGGG GCTACAAGCA GACTCTTTCC TACAGCGATC CGTTGATCGT GCTTGAGAGC TATCTTAACG CCTATCGCCC GATCCGTCTG CCCAATTTGC CGATCTTTGT TGGTGGTGCG GTTGGCTATC TCAGCTACGA AGCGGCCCGC TATTTTGAAC GCCTGCCGGT GCCGTCGGTG CGTCCCTACG ATATGCCTGA TAGCTGGTGG ATGTTTGTTG ATACGCTTCT CGCCTTCGAT CACGTCCGCC ACAAGATGAT CGTTATCTCG CACGTCCATC TCGATGTCGA GGATTTAGCG GCTGAATATC AGCGAGCCGT TACCCGCATC GAAACACTGA TAGCCCGGCT GCAAAAGCCA TTGCCTCCGA CTATCGGCTT TAGCCTGCGC AATTACGAAC CGCACAGTAC GCCGGCGCGT ACCGTGCCCA ATCCGGTCGT CTCGAACCGT ACTGAAGCCG AGTTCAAGGC GGCAGTGTTG CGGGCGAAAG AGTATATTAT GGCCGGTGAC ATCTTTCAGG TACAGATTTC GCAGCGCTTC AGCAAGGCAA CGAGCGCCGA CAGCTTCACC ATCTATCGCG CGCTCCGCAC CATCAACCCT TCGCCGTATA TGTTCTACAT CCGCACCGGC GAAGGCGATT TGGTCGGCGC CTCACCTGAA ATGCTGGTAC AGGTGCGCGA TGGTAACGTC ACGACCCGCC CGATTGCAGG GACGCGCTGG CGTGGACGCG ATGCCGCCGA AGATGAACGA CTGGCCGCCG AGTTGCTGGC CGACGAAAAG GAACGCGCCG AGCATTTGAT GTTGGTCGAT TTGGGGCGTA ACGATATAGG GCGGATCAGC GAACCGGGAA CGGTACACGT GCCGGTCTTT ATGACCATCG AGAAATACAG TCACGTCCAG CATATTGTGT CAGAAGTGGT CGGTAAACTC CGGGCCGATC TCAAGTCGAT CGATGCCTTA CGGGCCTGTT TTCCTGCCGG TACCGTCACC GGTGCGCCCA AGATTCGCTC GATGGAGATT ATCGCCGAAC TCGAAGGTGA GCAGCGTGGT ATCTATGCCG GCGCAGTTGG GCATCTTGGC TTCAACGGTG ATCTCGACAC ATGCATTGCG CTGCGTACCC TGATCGTCAA AGATGGGGTT GCCTATGCCC AAGCTGCTGC CGGGGTAGTT GCCGATAGCA CGCCCGAATA CGAGTTTAAT GAGAGTTGTA ATAAAGCAGC AGCCTCGTTA CGCGCGATTG ATCTGGCTGA AGAATTGCAG GCAGGATTGT GA
|
Protein sequence | MYYPTLDQMY ELRRQGNLCP IYREIMADLE TPVSAYLKIA QNGLGFLLES VTGGQNIGRY SFIGSDPYMV LRMHDGVAQA THGGYKQTLS YSDPLIVLES YLNAYRPIRL PNLPIFVGGA VGYLSYEAAR YFERLPVPSV RPYDMPDSWW MFVDTLLAFD HVRHKMIVIS HVHLDVEDLA AEYQRAVTRI ETLIARLQKP LPPTIGFSLR NYEPHSTPAR TVPNPVVSNR TEAEFKAAVL RAKEYIMAGD IFQVQISQRF SKATSADSFT IYRALRTINP SPYMFYIRTG EGDLVGASPE MLVQVRDGNV TTRPIAGTRW RGRDAAEDER LAAELLADEK ERAEHLMLVD LGRNDIGRIS EPGTVHVPVF MTIEKYSHVQ HIVSEVVGKL RADLKSIDAL RACFPAGTVT GAPKIRSMEI IAELEGEQRG IYAGAVGHLG FNGDLDTCIA LRTLIVKDGV AYAQAAAGVV ADSTPEYEFN ESCNKAAASL RAIDLAEELQ AGL
|
| |