Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3066 |
Symbol | |
ID | 7312444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3609595 |
End bp | 3610830 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643609968 |
Product | hypothetical protein |
Protein accession | YP_002507338 |
Protein GI | 220930429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAACA CTCAAAGGGC CCACGCAATA CTCTCGGCTT CAGGTTCAAA ACGATGGCTT AGTTGCCCAC CAAGTGCAAA GCTCGAAGAA CAATTCCCGG AAAGTACAAG CGAATTCGCA GAAGAAGGTA CATATGCCCA CAGCTTCGCA GAATTAAAGC TCCGAGGATA TATAACTACG GACCTCAAGC CAAGTGTCTA TAAAAAGAAA TTAGCAGAGA TTAAGAAAGA TCCTTTTTAT AGTCAAAGCT TAGATGATTA CATAGAACAA TATATCAATA TTGTGGGTGA GAAATACCTT GCTGCAAAAA AGAATAGTTC AGATTCTTTT GTAATGCTCG AGCAGAAACT TGATTTTTCA GAATGGGTTC CTGATGGTTT TGGAACTGGG GACGTAGTTT TGATATCTCC AGGGATTCTT GAAATTGTGG ACCTGAAATA TGGACAAGGT GTCCCTGTAT CCGCCGAAGG AAATACCCAA ATGCGATTAT ATGCCCTTGG TGCACTTAAT CAATATGGTA TGTTGTATGA CTTCGATAAA ATCAAAATGA CTATTATTCA GCCTAGACTT GACAGTATAT CCGAGGATGA AATAACGGTT CAGGAATTGC TTGACTGGGG AGAGTCAGTT GTTAAGCCTA CTGCGGATAT GGCAATTGCC GGTGAAGGAG AATTCAAGTC GGGAGATCAC TGTCAGTTTT GTAGAGCTAA AGCAGTATGC AGAAAAAGGG CTGAGGATAA TCTTGAAATG GCTAGATACG AATTCGAAGA TCCTAATATC TTATCAAATG ATGAGATAGC AGATATTCTA GCTAAAGCAG CAGAGCTCCA AAAATGGGCA TCAGATGTAC AAGCCTATGC ACTCGATCAA GCAGAGAATC ATGGGGTTAA ATTTACTGGT TGGAAGCTGG TCGAGGGTAG AAGTAATAGA AAATATACGG ATGAAGATGC TGTGGCTACA AAGCTGAAAG ACGAGGGTTA CGCATCGGAT GTTATATACC AGCCACAAAA AATCTGGGGT ATTAGCGAAA TGGAGAAAAA GATTGGTAAA AGGCTTTTCG CTGACTATCT TACTGAATTT GTTGTTAAAC CAGCAGGTAA AGCAACTCTT GTTCCAGAGA GTGATAAACG CCCGGAGATA TCATCCGTAG CATCAGCAGT AAGAGATTTT GATGACCTTT ATGAAAACAA GCTTCAACAT GAAACAGAAA AAATTCCAGA CGATATTTTA AATTGA
|
Protein sequence | MGNTQRAHAI LSASGSKRWL SCPPSAKLEE QFPESTSEFA EEGTYAHSFA ELKLRGYITT DLKPSVYKKK LAEIKKDPFY SQSLDDYIEQ YINIVGEKYL AAKKNSSDSF VMLEQKLDFS EWVPDGFGTG DVVLISPGIL EIVDLKYGQG VPVSAEGNTQ MRLYALGALN QYGMLYDFDK IKMTIIQPRL DSISEDEITV QELLDWGESV VKPTADMAIA GEGEFKSGDH CQFCRAKAVC RKRAEDNLEM ARYEFEDPNI LSNDEIADIL AKAAELQKWA SDVQAYALDQ AENHGVKFTG WKLVEGRSNR KYTDEDAVAT KLKDEGYASD VIYQPQKIWG ISEMEKKIGK RLFADYLTEF VVKPAGKATL VPESDKRPEI SSVASAVRDF DDLYENKLQH ETEKIPDDIL N
|
| |