Gene Ccel_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1033 
Symbol 
ID7309855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1285554 
End bp1286951 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content40% 
IMG OID643607960 
ProductL-arabinose isomerase 
Protein accessionYP_002505375 
Protein GI220928466 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000049306 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACCA AACAAAAACC AAGAATCGGA TTTTTGGGCC TAATGCAGGG ATTGTATGAC 
GAATCACAGC CGGAACTGCC GAAAATGCAG GAGGCATTTG CCAGAGAAGT GGTTGAACAA
TTAAAAGATG TGGCAGATAT TGATTTTCCC GGTCCAGCAA AAGAAAGAGA AGATATAGAA
AGATATGTAA AATATTTCAA TGATAAAGAG TACGATGGAA TAATGATAGT AAATCTGTTG
TACAGTCCGG GAAATCGTTT AATACAGGCT ATGAAGAATA ATAATCTGCC AATATTGCTG
GCTAATATTC AACCACTTCC CGATGTTACA TCAAACTGGG ATTGGATTTT GTGCACAACT
AATCAGGGAA TTCATGGAAT ACAGGATACA AGTAATGTTC TCATGCGTTG TGGTATTAAA
CCGGCTATTA TAACAGATGA TTGGAAGGCT GAATCCTTTA AAGCCTACTT TGAAGATTGG
GCATTGGCTG CCAACACGCA TAACAGACTA AAAAAGACAA AGGTTGCGAT TTTCGGCCGT
ATGCACAATA TGGGTGACAT ACTTGGTGAT GATGCGGCAT TGTGCAGAAA ATTTGGTGTA
GAGGCAAACC ATGTAACAAT CGGTCCGGTT TATTACAACA TGGAAGGATT GTCAGATAAA
GAAGTAGATG CCCAGATTGA GGAAGATAAA AAGAATTTTA AAATTGATCC TAATCTTCCT
GAAGAAAGTC ATCGGTATGC TGCACGTATG CAATTAGCCT TTGAAAAATT CCTTAATGAT
AACGGTTATG AAGGTTTTTC ACAGTTCTTC AACATATACA AGGAAGACGG CAGGTTCAAA
CAAATACCGA TATTGGCAGG CTCCAGTCTC CTTGCAAAAG GTTATGGTTA TTCGGCGGAA
GGTGATACAA ATGTACTTCT CATGACTGTG ATCGGTCACA TGATGATAGG GGATCCTCAT
TTTACTGAGA TGTACTCCCT GGACTTTGGT AAGGATTCAG CAATGCTAAG CCATATGGGA
GAAGGCAACT GGAAGGTTGC AAGGAAGGAT CGCGGAGTGA CACTGATTGA CAGGCCTCTT
GATATTGGTG GTCTTGGTAA TCCTCCGACA CCAAAGTTCA ACGTAGAACC AGGAACAGCT
ACCCTTGTTT CCCTCGTTGC AGTAGAAGGA GAAAAATACC AACTAATTGT ATCAAAGGGT
ACTATCCTTG ATACTGAGGA CTTGCCAGAT GTTCCTATGA ACCATGCTTT TTTCAGACCG
GATTCCGGCA TCAAAAAGGC TATGGACGAA TGGTTAGCTA ATGGTGGTAC ACATCACGAA
GTACTATTCC TGGGTGATTT TAGAAGACGT TTTGAATTAT TATGTAAAAT TCTTGACATA
AAATATATTG AAGTGTAA
 
Protein sequence
MITKQKPRIG FLGLMQGLYD ESQPELPKMQ EAFAREVVEQ LKDVADIDFP GPAKEREDIE 
RYVKYFNDKE YDGIMIVNLL YSPGNRLIQA MKNNNLPILL ANIQPLPDVT SNWDWILCTT
NQGIHGIQDT SNVLMRCGIK PAIITDDWKA ESFKAYFEDW ALAANTHNRL KKTKVAIFGR
MHNMGDILGD DAALCRKFGV EANHVTIGPV YYNMEGLSDK EVDAQIEEDK KNFKIDPNLP
EESHRYAARM QLAFEKFLND NGYEGFSQFF NIYKEDGRFK QIPILAGSSL LAKGYGYSAE
GDTNVLLMTV IGHMMIGDPH FTEMYSLDFG KDSAMLSHMG EGNWKVARKD RGVTLIDRPL
DIGGLGNPPT PKFNVEPGTA TLVSLVAVEG EKYQLIVSKG TILDTEDLPD VPMNHAFFRP
DSGIKKAMDE WLANGGTHHE VLFLGDFRRR FELLCKILDI KYIEV