Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1008 |
Symbol | |
ID | 4811302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1205397 |
End bp | 1206518 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106426 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001037433 |
Protein GI | 125973523 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00821655 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGAA ACGGTGCGGC AGTATCTAAA AACAAAACAA AAAAGCGAAA AAACAGACTC GCGTCTTTAT TTTTGTACTT TCTTGTTTTC CTTATTATAT TTACAGTCAG CACCTTGGCT TCTTATACAT ACTTCATAAA TGAGAAAGAG ATCAATTATG AAGAAGTTAT GGCAAAAATA GACCCCGAAA ACGGTATTCA GGTTGAAATC CCCCGGGGAG CCAATACGGA CGACATTGCA AACATCCTCA GGGAGCACGG AGTAATAAAA TATCCTTTTT GGTTTAAGTT TGTTTCCAAA TTCAACGGCT ATGACGGCCG TTACAAATCA GGAAAACATA TTGTGAACAA AGACCTTAAA TATAAAGAAA TTATGGAAAT ACTCTGCAGC AACCCTGTTA CAACCACCGT TACCATAATC GAGGGTAAAA ATACGGATCA AATTGCTGAT ATTCTAAGCG AAAAAAAGGT TATCGACAAG GAGGCCTTTC TTGAAGCCTG CAATACCGAA AAATTTGATT ATGAATTTTT AAAAGACATT CCGGAAAATC CTCAAAGAGA AAACAAACTT GAGGGATACC TTTTCCCGGA TACCTATTTC TTTGACCCAA AAGCAGGAGA ACGGGCAATA ATTGAAAAGT TTTTAGATAA CTTTGATGCA AAATTCAAGC CGGAATTTTA TGAAAGAGCC AAAGAGCTGA ACATGACGGT GGACGAGGTA ATTATTCTTG CCTCCATAAT AGAAAGGGAA ACAGCTCTCC CCGAAGAAAG GCCAATTGTT TCCAGCGTAT TTCACAACAG GCTGAAGTCT TCGGACCCCA ATCTAAAAAA GCTTGAATCC TGTGCCACCG TACAATATGT TTTGTACAAA ACTCAGGGAA AAATGAAGGA AAAGTTGTCC GACGAGGATA CAAAAATAGA CCACCCGTAC AACACATACC TTTATGAGGG GCTTCCGCCG GGACCCATAT GCTGTCCTGG TCTTGCCTCC ATAGAAGCTG CATTGTATCC GGACGAAGAA TCAGAGTATC TGTACTTTGT GGCAAAAGGA GACGGAAGTC ACGAATTTTC AAGAACTTTG GCCGAACATT TGGAAGCTGT TAAAAAGTAT CAATCCAATT GA
|
Protein sequence | MSGNGAAVSK NKTKKRKNRL ASLFLYFLVF LIIFTVSTLA SYTYFINEKE INYEEVMAKI DPENGIQVEI PRGANTDDIA NILREHGVIK YPFWFKFVSK FNGYDGRYKS GKHIVNKDLK YKEIMEILCS NPVTTTVTII EGKNTDQIAD ILSEKKVIDK EAFLEACNTE KFDYEFLKDI PENPQRENKL EGYLFPDTYF FDPKAGERAI IEKFLDNFDA KFKPEFYERA KELNMTVDEV IILASIIERE TALPEERPIV SSVFHNRLKS SDPNLKKLES CATVQYVLYK TQGKMKEKLS DEDTKIDHPY NTYLYEGLPP GPICCPGLAS IEAALYPDEE SEYLYFVAKG DGSHEFSRTL AEHLEAVKKY QSN
|
| |