Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1472 |
Symbol | |
ID | 7310241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1787108 |
End bp | 1788382 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643608398 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_002505806 |
Protein GI | 220928897 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.120948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAA GATTAAAAGT ATCATTGACG GGGATACTCA CCGTGATTAT TGGGGTTCTG CTATCGGTAC CTAACATCAT GAATTATTTC AAAACCCTGT TAAGAGGCTT TGATGCTAAA TTCGGTCTGC TTAATGCAGT GCTGACGGCA GTTGGGCTTA CATTGTTTCT TTTTGGTTTA TTTATAATAT CAGCCGGATT AAGGAAGCTA TCCTTCTGGC TGACCATCCT TATTGTTTTT GTATTTATTT TCACTGTTGG GGCCGTTATT ACTTTTAGAA ATACGGTTTC CACGGATGTA TCAGAGACAG TAACGGAGGA AATCAAAATA AAGGCTGATT CCGAAGGTGC AAAGATGATT GATATACCTA TGGGTTCAGA TACTAAGACT ATTGCAGGCA TACTCACAAA TGAGGGCATT ATCAACAAAC CGCAGATTTT CACAGTTGTA TCAAAAATAA ACGGTTTTGA TGGAAAGTAT CAGGCTGGCA CACATATTTT GAAGCCGGGT CTGGAATTCA ATTCTATTAT GACAATTCTT ACAGGGAAGC CTGAAAGCAA AAAGGTTACA ATACCTGAGG GCTTGAGCTA CAGACAGATT GTCAATACGT TTGTTAAAAA AGAACTTGCA ACCACAGACA AGTTTGATTA TGCAATGAAG TATGAAAAAT ACGATTACGA TTTTGTGAAA AACATGAAAA GTAGTAACAA TCGTGAATTT CAGCTAGAAG GATATTTATT TCCCGATACA TACGAATTTG CCATGAATGC CAGTGAAAAG ACAATAGTAA GTATAATGCT TGAAAACTTT AATAACAAGA TAACAAAAGA GCATTATAAA CGTGCCAAGG AATTAGGTAT GTCGATGGAC GAAATTATTA CTCTTGCTTC CATTATTGAA AGAGAGGCAA ATAATACTAA GGACAGAAGG CTGGTATCGG CAGTATTCCA TAGACGTTTA AAAAGCAGGG ATTTGAATAG GTTGCAGTCC TGTGCTACCA TACAGTATGT TTTTCTAAAT AAAGAAGGAA AAGTGCATGA AAAGCTTACT TACGAGGATA CTAAAATTAT AAGTCCATAT AATACGTATA TTCATCCGGG TCTTCCACCG GGACCAATCT GTTCACCGGG CATGGATTCC ATAAACGCAG CATTATACCC CGATGAAGAT ACAGACTACA TGTTCTTTAT CGCAGGGCCG GAAGGTTCTA CTAAGTTCTC CAAGACATAT CAGGAGCATT TAAAGGCAAT GAAGCAATAT GGATTGGCAA AATAA
|
Protein sequence | MDRRLKVSLT GILTVIIGVL LSVPNIMNYF KTLLRGFDAK FGLLNAVLTA VGLTLFLFGL FIISAGLRKL SFWLTILIVF VFIFTVGAVI TFRNTVSTDV SETVTEEIKI KADSEGAKMI DIPMGSDTKT IAGILTNEGI INKPQIFTVV SKINGFDGKY QAGTHILKPG LEFNSIMTIL TGKPESKKVT IPEGLSYRQI VNTFVKKELA TTDKFDYAMK YEKYDYDFVK NMKSSNNREF QLEGYLFPDT YEFAMNASEK TIVSIMLENF NNKITKEHYK RAKELGMSMD EIITLASIIE REANNTKDRR LVSAVFHRRL KSRDLNRLQS CATIQYVFLN KEGKVHEKLT YEDTKIISPY NTYIHPGLPP GPICSPGMDS INAALYPDED TDYMFFIAGP EGSTKFSKTY QEHLKAMKQY GLAK
|
| |