Gene Cthe_0875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0875 
Symbol 
ID4810493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1048404 
End bp1049888 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content44% 
IMG OID640106291 
Productanthranilate synthase, component I 
Protein accessionYP_001037302 
Protein GI125973392 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000225677 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTATC CAACCCTGGA CGAAGTCAAA ATAATGGCAA AAGATTATAA TATCATACCT 
GTCACAATGG AAGTATATGC CGACATGGAA ACCCCTATAA GCCTTTTTAA AAGGTTTGAG
GAAAGCAGTT GCTGTTTCCT TTTGGAGAGC GTTGAGGGCG GTGAAAAATG GGCCCGGTAC
TCCATCATCG GAAAAAATCC GTTTCTTGTT GTGGAAAGCT ACAAAAACAA AACCATTATA
AGGGAGAGGA ACGGTTCTCA AAGGGAAGTT GAAGGAAATC CTGTTGAAAT AATAAAGGGC
ATTATGGGGA AGTTTAAAGG TGCCAACCTT CCGAATCTTC CGAGATTCAA CGGGGGAGCG
GTGGGATATT TTGGGTATGA CCTCATACGA CACTATGAAA ATCTTCCCAA TGTCCCCGAA
GATGACATGG GTCTTCCGGA ATGCCATTTC ATGTTTACCG ACGAAGTGCT GGTGTATGAC
CATCTAAAGC AGAAAATTCA TATAATTGTT AATTTGCATG TCAACGGCAA CATTGAACGG
GCCTATATAA GCGCGGTTGA CCGGATAAAA ACCATACACA GGGAGATTCT TGACACCAGG
TGGAAAACCG CTGACAACTC TGTTCTAAGT TACAATAAAA AGAAAAATGA ACTTGCGGTA
ACCAGCAATA TTTCAAAAGA GGATTTCTGT CGGAATGTGT TGAAGGCAAA GCAGTATATA
AGGGACGGAG ACATATTCCA GGTGGTTTTG TCGCAACGCT TGTGTGTTGA GACAAATGAA
AATCCTTTTA ACATATACCG CGCCTTAAGG GTTATAAATC CTTCTCCATA TATGTATTAT
CTTAAATTTG GCGGCTACAG AATAATAGGT TCTTCCCCCG AGATGCTGGT CAGGGTTGAA
AATGGAATTG TGGAAACCTG TCCGATTGCA GGAACGCGAA AGAGAGGCAG GACAAAAGAA
GAGGATGAGG CTTTGGAAAA AGAGCTTCTT TCCGATGAGA AAGAAATAGC CGAGCATGTG
ATGCTGGTGG ACCTGGGCAG AAACGATATC GGAAGAGTAT CGAAATTTGG TACCGTAGCG
GTAAAGAACC TTATGCACAT TGAGAGATAT TCCCATGTAA TGCATGTGGT AACAAACGTA
CAGGGAGAGA TTCGGGAGGA TAAGACTCCT TTTGACGCCC TTATGTCCAT TCTTCCTGCC
GGTACCCTTT CCGGAGCGCC AAAGGTCAGG GCTATGGAGA TAATAGACGA GCTTGAGACC
GTAAAAAGAG GTCCCTACGG CGGTGCGATC GGGTATCTTA GCTTTAACGG CAATCTCGAC
AGCTGCATAA CCATAAGGAC AATTATATTA AAGGACGGAA AGGCTTATGT TCAGGCCGGA
GCGGGCATAG TCGCGGATTC GGTCCCGGAA AGGGAGTATG AAGAGTGCTA CAACAAAGCA
ATGGCACTTC TTAAAGCCAT AGAAGAGGCA GGTGAAATAA GATGA
 
Protein sequence
MFYPTLDEVK IMAKDYNIIP VTMEVYADME TPISLFKRFE ESSCCFLLES VEGGEKWARY 
SIIGKNPFLV VESYKNKTII RERNGSQREV EGNPVEIIKG IMGKFKGANL PNLPRFNGGA
VGYFGYDLIR HYENLPNVPE DDMGLPECHF MFTDEVLVYD HLKQKIHIIV NLHVNGNIER
AYISAVDRIK TIHREILDTR WKTADNSVLS YNKKKNELAV TSNISKEDFC RNVLKAKQYI
RDGDIFQVVL SQRLCVETNE NPFNIYRALR VINPSPYMYY LKFGGYRIIG SSPEMLVRVE
NGIVETCPIA GTRKRGRTKE EDEALEKELL SDEKEIAEHV MLVDLGRNDI GRVSKFGTVA
VKNLMHIERY SHVMHVVTNV QGEIREDKTP FDALMSILPA GTLSGAPKVR AMEIIDELET
VKRGPYGGAI GYLSFNGNLD SCITIRTIIL KDGKAYVQAG AGIVADSVPE REYEECYNKA
MALLKAIEEA GEIR