Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1033 |
Symbol | |
ID | 7309855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1285554 |
End bp | 1286951 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607960 |
Product | L-arabinose isomerase |
Protein accession | YP_002505375 |
Protein GI | 220928466 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000049306 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAACCA AACAAAAACC AAGAATCGGA TTTTTGGGCC TAATGCAGGG ATTGTATGAC GAATCACAGC CGGAACTGCC GAAAATGCAG GAGGCATTTG CCAGAGAAGT GGTTGAACAA TTAAAAGATG TGGCAGATAT TGATTTTCCC GGTCCAGCAA AAGAAAGAGA AGATATAGAA AGATATGTAA AATATTTCAA TGATAAAGAG TACGATGGAA TAATGATAGT AAATCTGTTG TACAGTCCGG GAAATCGTTT AATACAGGCT ATGAAGAATA ATAATCTGCC AATATTGCTG GCTAATATTC AACCACTTCC CGATGTTACA TCAAACTGGG ATTGGATTTT GTGCACAACT AATCAGGGAA TTCATGGAAT ACAGGATACA AGTAATGTTC TCATGCGTTG TGGTATTAAA CCGGCTATTA TAACAGATGA TTGGAAGGCT GAATCCTTTA AAGCCTACTT TGAAGATTGG GCATTGGCTG CCAACACGCA TAACAGACTA AAAAAGACAA AGGTTGCGAT TTTCGGCCGT ATGCACAATA TGGGTGACAT ACTTGGTGAT GATGCGGCAT TGTGCAGAAA ATTTGGTGTA GAGGCAAACC ATGTAACAAT CGGTCCGGTT TATTACAACA TGGAAGGATT GTCAGATAAA GAAGTAGATG CCCAGATTGA GGAAGATAAA AAGAATTTTA AAATTGATCC TAATCTTCCT GAAGAAAGTC ATCGGTATGC TGCACGTATG CAATTAGCCT TTGAAAAATT CCTTAATGAT AACGGTTATG AAGGTTTTTC ACAGTTCTTC AACATATACA AGGAAGACGG CAGGTTCAAA CAAATACCGA TATTGGCAGG CTCCAGTCTC CTTGCAAAAG GTTATGGTTA TTCGGCGGAA GGTGATACAA ATGTACTTCT CATGACTGTG ATCGGTCACA TGATGATAGG GGATCCTCAT TTTACTGAGA TGTACTCCCT GGACTTTGGT AAGGATTCAG CAATGCTAAG CCATATGGGA GAAGGCAACT GGAAGGTTGC AAGGAAGGAT CGCGGAGTGA CACTGATTGA CAGGCCTCTT GATATTGGTG GTCTTGGTAA TCCTCCGACA CCAAAGTTCA ACGTAGAACC AGGAACAGCT ACCCTTGTTT CCCTCGTTGC AGTAGAAGGA GAAAAATACC AACTAATTGT ATCAAAGGGT ACTATCCTTG ATACTGAGGA CTTGCCAGAT GTTCCTATGA ACCATGCTTT TTTCAGACCG GATTCCGGCA TCAAAAAGGC TATGGACGAA TGGTTAGCTA ATGGTGGTAC ACATCACGAA GTACTATTCC TGGGTGATTT TAGAAGACGT TTTGAATTAT TATGTAAAAT TCTTGACATA AAATATATTG AAGTGTAA
|
Protein sequence | MITKQKPRIG FLGLMQGLYD ESQPELPKMQ EAFAREVVEQ LKDVADIDFP GPAKEREDIE RYVKYFNDKE YDGIMIVNLL YSPGNRLIQA MKNNNLPILL ANIQPLPDVT SNWDWILCTT NQGIHGIQDT SNVLMRCGIK PAIITDDWKA ESFKAYFEDW ALAANTHNRL KKTKVAIFGR MHNMGDILGD DAALCRKFGV EANHVTIGPV YYNMEGLSDK EVDAQIEEDK KNFKIDPNLP EESHRYAARM QLAFEKFLND NGYEGFSQFF NIYKEDGRFK QIPILAGSSL LAKGYGYSAE GDTNVLLMTV IGHMMIGDPH FTEMYSLDFG KDSAMLSHMG EGNWKVARKD RGVTLIDRPL DIGGLGNPPT PKFNVEPGTA TLVSLVAVEG EKYQLIVSKG TILDTEDLPD VPMNHAFFRP DSGIKKAMDE WLANGGTHHE VLFLGDFRRR FELLCKILDI KYIEV
|
| |