Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1991 |
Symbol | |
ID | 7310702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2357340 |
End bp | 2358650 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608926 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002506319 |
Protein GI | 220929410 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACA AAACACAGAT GGACGCCGCA AAAAAAGGTA TTATTACAAA TGAAATGAAG ATTGTTTCTC GAAAGGAATC AATGGATGAA AATAAGCTTC GGGAGCTTGT TGCTGAGGGC AGAATAGCTA TTCCGGCAAA TATAAATCAC AAGTCCTTAA GTCCGGAGGG AATAGGAGAA GGTCTCAGAA CAAAAATCAA TGTAAACCTT GGTATATCAG GGGATTGTCC TGACTATACA AAAGAAATGG AAAAGGCTGA TATGTCAATC AAATTTGGTG TTGAAGCCAT AATGGACCTT AGTAATTACG GAAAGACAAA CACCTTTCGC AAAGAGCTTA TAAAGCGTTC TCCTGCCATG ATAGGAACCG TTCCCATGTA TGATGCCATA GGCTACCTTG AAAAGGACCT GATGGATATT AAAGCTTCGG ACTTTTTAAA AGTTGTTGAG GCTCATGCTG CAGAAGGTGT GGACTTTATG ACAATTCATG CAGGAATAAA TAAACGTGCC GTTGAGTGTT TCAAACGCTC AAAGAGACTT ACCAATATTG TTTCAAGAGG AGGTTCCCTT CTTTTTGCAT GGATGGAGAT GACTGGCAAT GAAAATCCTT TTTTTGAGTA TTATGACGAT TTCCTTGGAA TACTCAGGGA ATATGATGTA ACCATAAGTC TTGGAGATGC ATTGAGGCCC GGTAGTATCA ATGACAGCTC AGACGCCGGA CAATTAAGTG AACTTATAGA ACTGGGCGAC CTGACCAAAC GTGCATGGGA AAAGGATGTA CAGGTAATGG TTGAGGGGCC GGGACATATG GCTATGAACG AGATAGCCGC CAATATGACT ATTCAAAAGA GACTATGTCA TGGAGCACCT TTTTATGTAC TGGGGCCTTT GGTGACAGAT ATAGCACCGG GATATGACCA TATCACCTCG GCAATAGGCG GAGCAATTGC TGCGGCAAAC GGTGCAGATT TCTTATGTTA TGTAACCCCT GCGGAGCATT TAAGGTTACC TGACCTTTCG GATGTAAAAG AAGGAATAGT TGCTTCAAAA ATAGCCGCTC ATGCCGCCGA TATTGCAAAG GGAATACCTA ATGCACGTGA AATAGATAAT AAAATGAGCG ATGCCAGACG CCGAATCGAC TGGGAAGAAA TGTTCTCATA TGCTATTGAC GAAGATAAGG CAAGAGCATA TTTTGAAAGC ACACCTCCCA CTGACAGACA TACCTGCTCA ATGTGCGGAA AAATGTGTGC TATGAGGACT ACAAATAAGA TTTTAGCCGG TGAAAAGGTT GAGTTTGTAA CAGAGAAATA G
|
Protein sequence | MNYKTQMDAA KKGIITNEMK IVSRKESMDE NKLRELVAEG RIAIPANINH KSLSPEGIGE GLRTKINVNL GISGDCPDYT KEMEKADMSI KFGVEAIMDL SNYGKTNTFR KELIKRSPAM IGTVPMYDAI GYLEKDLMDI KASDFLKVVE AHAAEGVDFM TIHAGINKRA VECFKRSKRL TNIVSRGGSL LFAWMEMTGN ENPFFEYYDD FLGILREYDV TISLGDALRP GSINDSSDAG QLSELIELGD LTKRAWEKDV QVMVEGPGHM AMNEIAANMT IQKRLCHGAP FYVLGPLVTD IAPGYDHITS AIGGAIAAAN GADFLCYVTP AEHLRLPDLS DVKEGIVASK IAAHAADIAK GIPNAREIDN KMSDARRRID WEEMFSYAID EDKARAYFES TPPTDRHTCS MCGKMCAMRT TNKILAGEKV EFVTEK
|
| |