Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2581 |
Symbol | |
ID | 4809188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3054052 |
End bp | 3055233 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107995 |
Product | dihydropteroate synthase |
Protein accession | YP_001038974 |
Protein GI | 125975064 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAACG CAAGAATTGT CTACATAAAT GATATGCATG AAGCAAAAGA AGAAATCCGC AAAATCGGTG TGGATGCGTC AGCCATAACA TGGCTTTCAC CCAAAGCATT GTCCATTGCA ATAAAGCTTG AGAATGTAAG CTCTTATGAG GCAAACATAC TCAAGCAGGA AATGCTGGCC AAGGGCGGAG ATGCCGCCGT AAACAGAGGT GTGGCAAATT TCAGCACGGA AAGCTCCGAT GTTCTCCTGA TGGGCACATA CAGCCAGTTT AACAAACTTG TGTACAAGCT TCGCATGCAC GGCGGAAGTT TCAAGGAAAT TACCGATGAA ATCCAAAGGC TTCTGGAAGG TTTGGAGAAG GGAAAGCCGG AGTATTTTGA ATGCGGCAAG TATAGACTGC CCATAGGAGA AAAAACTTAT GTGATGGGAA TACTCAATGT TACACCGGAT TCTTTCTCCG ACGGTGGAAA ATATCTTGAT ATTGACAGTG CGGTAAAAAG AGCCAGGGAA ATGGTGGATG AAGGCGCTGA CATAATAGAT GTGGGAGGAG AGTCGACGAG GCCCGGGCAT CAGCCCGTTG ATGCCCTGGA GGAAATAAAC CGGGTGATAC CGGTTATAGA AAGGCTTTCA AAGGAGTTGA ATGTGCCCAT ATCGGTTGAC ACCAGCAAGG CTCAGGTTGC AGAAAAGGCT CTTTGTGCGG GTGCCTGCAT TGTAAATGAC GTTTGGGGCC TCCAGAGGGA CCCGGACATG GCGGAGGTTG TATCAAAGCA CGGTGCAGGA GTAATTATGA TGCACAACAG TGACACCAAA GAATACAAGG ACCTAATGGG TGACATTATA AGGTTTTTGA GAAAGAGTGT TGAAATAGCC GAAAAGGCCG GAATTACCAG GGAAAATATG GTTATTGACC CCGGAATAGG CTTTGGAAAG ACTTTGGAGC ACAACCTGGA AGTAATGAGA AGAATGAGGG AACTAAACAC CCTAAACCTT CCGGTTCTTC TTGGGACATC AAGAAAGTCC ATGATAGGAA ATGTTTTGGA TTTGCCTGTA AATGAAAGGC TTGAAGGGAC TGCCGCAACC ATTACCCTTG GTATTGCCAA CGGGGCGGAT ATAGTGAGAG TCCACGATGT AAAGGAAATG GTGCGGGTGG CAAGGATGAC CGATGCTATG GTAAGAGTTT GA
|
Protein sequence | MINARIVYIN DMHEAKEEIR KIGVDASAIT WLSPKALSIA IKLENVSSYE ANILKQEMLA KGGDAAVNRG VANFSTESSD VLLMGTYSQF NKLVYKLRMH GGSFKEITDE IQRLLEGLEK GKPEYFECGK YRLPIGEKTY VMGILNVTPD SFSDGGKYLD IDSAVKRARE MVDEGADIID VGGESTRPGH QPVDALEEIN RVIPVIERLS KELNVPISVD TSKAQVAEKA LCAGACIVND VWGLQRDPDM AEVVSKHGAG VIMMHNSDTK EYKDLMGDII RFLRKSVEIA EKAGITRENM VIDPGIGFGK TLEHNLEVMR RMRELNTLNL PVLLGTSRKS MIGNVLDLPV NERLEGTAAT ITLGIANGAD IVRVHDVKEM VRVARMTDAM VRV
|
| |