Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0875 |
Symbol | |
ID | 4810493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1048404 |
End bp | 1049888 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106291 |
Product | anthranilate synthase, component I |
Protein accession | YP_001037302 |
Protein GI | 125973392 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000225677 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATC CAACCCTGGA CGAAGTCAAA ATAATGGCAA AAGATTATAA TATCATACCT GTCACAATGG AAGTATATGC CGACATGGAA ACCCCTATAA GCCTTTTTAA AAGGTTTGAG GAAAGCAGTT GCTGTTTCCT TTTGGAGAGC GTTGAGGGCG GTGAAAAATG GGCCCGGTAC TCCATCATCG GAAAAAATCC GTTTCTTGTT GTGGAAAGCT ACAAAAACAA AACCATTATA AGGGAGAGGA ACGGTTCTCA AAGGGAAGTT GAAGGAAATC CTGTTGAAAT AATAAAGGGC ATTATGGGGA AGTTTAAAGG TGCCAACCTT CCGAATCTTC CGAGATTCAA CGGGGGAGCG GTGGGATATT TTGGGTATGA CCTCATACGA CACTATGAAA ATCTTCCCAA TGTCCCCGAA GATGACATGG GTCTTCCGGA ATGCCATTTC ATGTTTACCG ACGAAGTGCT GGTGTATGAC CATCTAAAGC AGAAAATTCA TATAATTGTT AATTTGCATG TCAACGGCAA CATTGAACGG GCCTATATAA GCGCGGTTGA CCGGATAAAA ACCATACACA GGGAGATTCT TGACACCAGG TGGAAAACCG CTGACAACTC TGTTCTAAGT TACAATAAAA AGAAAAATGA ACTTGCGGTA ACCAGCAATA TTTCAAAAGA GGATTTCTGT CGGAATGTGT TGAAGGCAAA GCAGTATATA AGGGACGGAG ACATATTCCA GGTGGTTTTG TCGCAACGCT TGTGTGTTGA GACAAATGAA AATCCTTTTA ACATATACCG CGCCTTAAGG GTTATAAATC CTTCTCCATA TATGTATTAT CTTAAATTTG GCGGCTACAG AATAATAGGT TCTTCCCCCG AGATGCTGGT CAGGGTTGAA AATGGAATTG TGGAAACCTG TCCGATTGCA GGAACGCGAA AGAGAGGCAG GACAAAAGAA GAGGATGAGG CTTTGGAAAA AGAGCTTCTT TCCGATGAGA AAGAAATAGC CGAGCATGTG ATGCTGGTGG ACCTGGGCAG AAACGATATC GGAAGAGTAT CGAAATTTGG TACCGTAGCG GTAAAGAACC TTATGCACAT TGAGAGATAT TCCCATGTAA TGCATGTGGT AACAAACGTA CAGGGAGAGA TTCGGGAGGA TAAGACTCCT TTTGACGCCC TTATGTCCAT TCTTCCTGCC GGTACCCTTT CCGGAGCGCC AAAGGTCAGG GCTATGGAGA TAATAGACGA GCTTGAGACC GTAAAAAGAG GTCCCTACGG CGGTGCGATC GGGTATCTTA GCTTTAACGG CAATCTCGAC AGCTGCATAA CCATAAGGAC AATTATATTA AAGGACGGAA AGGCTTATGT TCAGGCCGGA GCGGGCATAG TCGCGGATTC GGTCCCGGAA AGGGAGTATG AAGAGTGCTA CAACAAAGCA ATGGCACTTC TTAAAGCCAT AGAAGAGGCA GGTGAAATAA GATGA
|
Protein sequence | MFYPTLDEVK IMAKDYNIIP VTMEVYADME TPISLFKRFE ESSCCFLLES VEGGEKWARY SIIGKNPFLV VESYKNKTII RERNGSQREV EGNPVEIIKG IMGKFKGANL PNLPRFNGGA VGYFGYDLIR HYENLPNVPE DDMGLPECHF MFTDEVLVYD HLKQKIHIIV NLHVNGNIER AYISAVDRIK TIHREILDTR WKTADNSVLS YNKKKNELAV TSNISKEDFC RNVLKAKQYI RDGDIFQVVL SQRLCVETNE NPFNIYRALR VINPSPYMYY LKFGGYRIIG SSPEMLVRVE NGIVETCPIA GTRKRGRTKE EDEALEKELL SDEKEIAEHV MLVDLGRNDI GRVSKFGTVA VKNLMHIERY SHVMHVVTNV QGEIREDKTP FDALMSILPA GTLSGAPKVR AMEIIDELET VKRGPYGGAI GYLSFNGNLD SCITIRTIIL KDGKAYVQAG AGIVADSVPE REYEECYNKA MALLKAIEEA GEIR
|
| |