Gene Ccel_2443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2443 
Symbol 
ID7311114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2947209 
End bp2948582 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content35% 
IMG OID643609373 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002506752 
Protein GI220929843 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000052902 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACAT GTTTGGGCAA GTTAGATTTT CTTAAAAAAA CCGAGGTGTA CATAGCAGTG 
AATTTATTGA CGAAGTTCAA GGAAATATAT TTTAATAAAG AAAGTCTTTC AGAAGAAGAT
ATGAGGGTGT CAAGGAATTT ATCTATTTTT GAGGGCTGTA CCGCCAGAAG TATTCTTACC
CTAACCAGCG GAGCTTTTTT AGTCGGATTC GCCAAGTATC TTGGTGCTAG CGATGAAAAA
GCTGGAATTA TAGCTGCAAT TCCTGTATTA GCAGGAATAG TAACGGTTTT TTCCCCTATA
GTAATTGAGA AGCTGGAGAG CAGGAAATTA CTGACCTGTA TGCTATGCTT TATTGGAAGA
TTAATGATGG GGCTTATGAT ACTTATACCT TTCATAAGTC CATACAAAAC AGTAAGGGTT
CAATTGTTGA TATGGGTATT CTTCATTGCA AACTTAATCC TGGCTTTTAC AACTCCTTAT
GCACAGACGT GGTTGCTGAA TATAACCCCG AAAAGAATAA GAGGTGATTA TTATGGAAAA
CGGGAGTCAA TAGTTCTGGG TACCGTTACT GTTGTTACCC TTATTATGGG ACAGGTTCTC
GATAAATTTG AACGAATGGG ACAACAATTT ACCGGGTTTA TTGTATTATA TGCTTTTGTT
ATTGTTACCG CCATTATAAA CACTGTTTTG TTTTCAAAAA TTAAAGAACC CGTTAATCCT
GTTTTAAAAC CAGGGGTTTC ATTTAAAAAT TTATTTTCGC TACCTGTTAA AAACAAGAAT
TTCATGAAAA TAACTTTCAT AACTCTATTT TGGAATTTAG GTTATCAGAT AGCTTTCCCA
TTTACTTCGG TTTATATGGT ATCAATCCTT CATTTGAGAT ATGGACTTGT TACGGTAATG
GCTGTTCTGG CATCAATCAC AAGTGTAGTA TCCGTTAGGT TCTGGGGAAA AATTGCAGAT
AAAAAATCGT GGCTGTATAT TATGAAGCTT ATGATTGTTC TACAGATTTT AAGCTTTCTT
ACATGGTTTT TTATAAATCC AGATACGGTA TACATTTTAA TGCCTGTAGC TCATATACTT
GGTGGAGCTG CAATTTCAGG AGTAAATATC TCTGTGAATA ATTTGCAGTA CAGTTATTCA
CCTGCCGATA ATAAAACGGT ATACATGGGT TTTTCGTCGG CGGTAAATGG TATAATTGGA
TTTCTAGGAA CTCTAGCAGG TTCACTCTTC ATTAAGGTTA TGGATACCAG AGGAGTTTCT
CTTGGAGGGT TTTCAATCGG TAATATGCAG ATGCTCTTTT TAGCAGCGGT GATTGTTTTA
ATAGTAAGTA TGTTTGGCAT ATCCAAATTC AAATTTAGCA ATTCTAATAT TTAA
 
Protein sequence
MSTCLGKLDF LKKTEVYIAV NLLTKFKEIY FNKESLSEED MRVSRNLSIF EGCTARSILT 
LTSGAFLVGF AKYLGASDEK AGIIAAIPVL AGIVTVFSPI VIEKLESRKL LTCMLCFIGR
LMMGLMILIP FISPYKTVRV QLLIWVFFIA NLILAFTTPY AQTWLLNITP KRIRGDYYGK
RESIVLGTVT VVTLIMGQVL DKFERMGQQF TGFIVLYAFV IVTAIINTVL FSKIKEPVNP
VLKPGVSFKN LFSLPVKNKN FMKITFITLF WNLGYQIAFP FTSVYMVSIL HLRYGLVTVM
AVLASITSVV SVRFWGKIAD KKSWLYIMKL MIVLQILSFL TWFFINPDTV YILMPVAHIL
GGAAISGVNI SVNNLQYSYS PADNKTVYMG FSSAVNGIIG FLGTLAGSLF IKVMDTRGVS
LGGFSIGNMQ MLFLAAVIVL IVSMFGISKF KFSNSNI