Gene Ccel_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2100 
Symbol 
ID7310801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2456773 
End bp2458803 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content33% 
IMG OID643609034 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002506425 
Protein GI220929516 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAA CAAAAATATG CAGAAAAATA AAAAATACAA ATATAAGACT TCGCATACAA 
CTGCTTATAG TACTTATAAC AGTAACACTT ATTCCAATAA TCGCAGTAAG TACTACAACT
TACATCACAA CCATAGGAAA AATAACTGAA CTCTCATTAA ATAATCTTAA ATCTGATTCC
TATAATACTA TGAGTAATAT AGAACTAAAA ATAAAAAGTT TGGACAGTAC AATCAAGGGA
GTGGCTTCCC AGACAGACTT TTTGGTTGGA CTTGAAATGG CCAATAGTGT TAATGAAAAG
ATGGATACAG TAACATATAG CGGAATTCAG CTCTCGATGA AAAATGTAGT AGAAGGATCA
GAAGGACTCA TTCAAACAAT GTATTTATGC AATAAAAGGG GAAAAGTAAT AGCCACAGCT
GCAAAAAGAA CAAAAACAGC AGGTGTAGAT GATTTTTATA ATATGCAGCT TTTTGAAAGT
ATAAAAAAAG ATGCAAACAA TGAAGTTATA GTCGGTAATT CTATTATTTT AAAAGGTGCA
AACAAAAAGG TTATACCTGT TACGAGAGCA GTTAAAAGTC TGGCAGGTTT TAGCGGAACC
ATAACTGCAC TGGTGGACTA TGAGAAATTC TTCAAATTTG GTAAGAACCA AATTGAAAGC
GAAATTATTA TATTGGACAG CGGACTAAAA GCTTTTTACG GCAAGGACAA TAATGAAATA
AACAGCAAAA TACCAATTAA AGAAGCTGTG GATGATGAAA ATATTACATA TATGGATTCA
GGAACCAAAA AAATTGCACA CTTGTATAAA TCAGATCTTA CAAACTGGAT AGTGTGTGCT
CAAATGGACT ATAGCAAAGT AATGTTACCT GTAAAGCAAT ATATTCTCAT CCTAATAATC
GTTTTGGTTT TATCACTTCT GCTTGCTGCA TTTATTTCCA TCTTTTATTC AAAATATATA
TCAAGTCCTG TAGTTGAACT CACACGACAA ATAAAGAAAG TTGAAGATGG CTTTCTGGAA
GTTCATTTTG AAAAAAGAAG CAATATATCA GAAATAAACA GTTTAACAAC TGCATTTGAA
AATATGGTTA GAAATTTAAA TATACTTATT TCCGGTATTA GCTCGGCTTC CAAGGAAATA
GACGAAATGT CCGCCCTTAT GTATAGTGAG GCCAGTGAGT CATTTGAAAA GTCAGAATTT
ACTCAAAAAT CAATCTCAAA CATAAATGTT AATATAAAGG ACCAAGCGGA CAATACAAGC
AACGCGACGG TGGAAATAAA AAGTCTTGCA GAACAGATTG CTACAACAAG GGAACATTCA
AACAATGTCT ACAACTTTCT TGACAGGCTT AACAATTCAG CAAAAAGAGG TAAAAGCCAA
ATGGATAAGC TGGAGGCAAA TTCTACACTA AATCTGCAAA GCATTAGTAA AATGAATGAA
ATGATAATTG GGCTCCAGAC ACAAATGAAA CAGATAAATA CTATAACTGC TGCAATTCAG
AGTGTAGCTA AACAGACACA GTTGTTGTCA CTTAATGCAA GGATAGAGGC TTCAAGGGCA
GGGGAATCGG GAAAAGGCTT TGCTGTAGTG GCTGATGAAA TTAAGGAACT ATCTATTCAG
ACAAACTCAC AAGCAGGAGT AATTAGAAAT ATGATTGAGA GTATTGTACA AAATTCAAAC
AACCTGACTA AGGGCTTTGA AGAGGTAAGC AAAGGAACTG ATTCTCAAAA TAGCTGTATT
AATGAAACAA AAGACTGCTT TCTGGAAATC AAAAAGAACA TTGATAATAT AAATAGCCGT
CTTTTTAATA TAACAGATTA TTTACAGGAA ATGGATAAAC AGAAAGACAA TCTTGTATTA
CTGGTAAATC AAATAAATAA CGCTGCCGTA GAGATAGCAC ACAGTTCTGA CCATGTTCAT
GAATACACTA AAAACCATAT TATTTCTGTA AAAAAAGTCC ATGAAAAATC AAACATATTT
AAGAGCTTAT CCCAAAAACT GAATTCATCT GTAGGATTAT TTAAAGTTTA G
 
Protein sequence
MSVTKICRKI KNTNIRLRIQ LLIVLITVTL IPIIAVSTTT YITTIGKITE LSLNNLKSDS 
YNTMSNIELK IKSLDSTIKG VASQTDFLVG LEMANSVNEK MDTVTYSGIQ LSMKNVVEGS
EGLIQTMYLC NKRGKVIATA AKRTKTAGVD DFYNMQLFES IKKDANNEVI VGNSIILKGA
NKKVIPVTRA VKSLAGFSGT ITALVDYEKF FKFGKNQIES EIIILDSGLK AFYGKDNNEI
NSKIPIKEAV DDENITYMDS GTKKIAHLYK SDLTNWIVCA QMDYSKVMLP VKQYILILII
VLVLSLLLAA FISIFYSKYI SSPVVELTRQ IKKVEDGFLE VHFEKRSNIS EINSLTTAFE
NMVRNLNILI SGISSASKEI DEMSALMYSE ASESFEKSEF TQKSISNINV NIKDQADNTS
NATVEIKSLA EQIATTREHS NNVYNFLDRL NNSAKRGKSQ MDKLEANSTL NLQSISKMNE
MIIGLQTQMK QINTITAAIQ SVAKQTQLLS LNARIEASRA GESGKGFAVV ADEIKELSIQ
TNSQAGVIRN MIESIVQNSN NLTKGFEEVS KGTDSQNSCI NETKDCFLEI KKNIDNINSR
LFNITDYLQE MDKQKDNLVL LVNQINNAAV EIAHSSDHVH EYTKNHIISV KKVHEKSNIF
KSLSQKLNSS VGLFKV