Gene Ccel_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1543 
Symbol 
ID7310307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1878421 
End bp1880769 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content37% 
IMG OID643608472 
Productcellulosome anchoring protein cohesin region 
Protein accessionYP_002505875 
Protein GI220928966 
COG category[R] General function prediction only 
COG ID[COG3401] Fibronectin type 3 domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000430269 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACA AGAAAGTACT GTTTACGTGT TTTTTTCTCC TAGCATTAGT TATTCCAGTA 
TCAGCTAATG CCACTACGGA AAAAAATATA ATTGATTTAT CTAAATGGCA AAATGTAGGT
ATAAATTATA GTGGCTCAGA GCCAGCACCA ATTTGGAAAG TGAATTCGAC AAATACAGCT
GTAACGCAGA CTTTAAATGC CAGACCGGCA GTTTTTATGG GTGATATGGA GTGTGCGAAT
AATTCAGTGC GTGGAAGTTT TAGTGTCGAT ACAACTTCAG ATAACGACTT TGTAGGTTTT
GTTTTTGGCT ATAAGGATTC AGGACATTTC TATTTGTTTG ATTGGAAGCA ATCGGATGAA
GATTTTCATG GACTAGGAAA ACAAGGTATG AGTATTAAGA AAGTAAATAC GAATACATCT
CTTAGTGACA GTGACTTGTG GCCGACTTCT AATACTGATA AGGTTAAAGT TTTATATCAT
AATAACATTA CATATAAGGA TCAAATGCTT TATAATTTTA CATTAAATTT TACAGATAAA
GGTAGCTTTA ATATTGTAAT AAAGCAGGGG AATACAGTTT TGGACAGTAT CACCATAAAG
GATAGCACAT ACACTTCTGG TAAATTCGGT TTTTATAATT ACTCTCAAGA AATGGTTACA
TACAGTGTAT CTGAATTTGA AAAACTCCCC CCTGTGGTGA AAGTTACTTC AACTCAAAAT
GAAAAGGTTG ATTTATCCTG GACAGCTGTA GAAGGTGCGA CCAGCTACAA TATAAAACGT
GATACTACTT CTGGTGGCCC ATATACAACA ATCGGACAAA GCACTTCGAC GACATATACT
GACACAACAG TAGCCAATGG AACAACATAC TACTATGTTG TAACTGCTGT AAATACCGGA
GGAGAGAGTG AAAATTCCAA TGAAGTATCT GCAAAGCCAA TAGCACCTGC AAAAGCACCA
ATAAACCTTG TAGCAAAAGC TAATAATGCA AAAGTTGATT TAGTTTGGTC TGCATCTCAA
TCTGCGACCA GCTACAATAT AAAACGTGCT ACTACTGCTG GTGGCCCATA TACAACAATC
GGACAAAGCA CTTCGACAAC ATATACTGAC ACAACAGTAG CCAATGGAAC AACATACTAC
TATGTTGTAA CTGCTGTAAA TGCTGGAGGA GAGAGTGAAA ATTCCAATGA AGTATCTGCA
AGGCCAATAG CACCTGCAAA AGCACCAATA AACCTTGTAG CAAAAGCTAA TAATGCAAAA
GTTGATTTAG TTTGGTCTGC ATCTCAATCT GCGACCAGCT ACAATATAAA ACGTGCTACT
ACTGCTGGTG GCCCATATAC AACAATCGGA CAAAGCACTT CGACAACATA TACTGACACA
ACAGTAGCCA ATGGAACAAC ATACTACTAT GTTGTAACTG CTGTAAATGC CGGAGGAGAG
AGTGAAAATT CCAATGAAGT ATCTGCAAAG CCAATAGCAC CTGCAAAAGC ACCAATAAAC
CTTGTAGCAA AAGCTAATAA TGCAAAAGTT GATTTAGTTT GGTCTGCATC TCAATCTGCA
ACCAGCTACA ATATAAAACG TTCTACCACT GCTGGTGGCC CATATACAAC AATCGGACAA
AGCACTTCGA CAACATATAC TGACACAACA GTAGCCAATG GAACAACATA CTACTATGTT
GTAACTGCTG TAAATGCCGG AGGAGAGAGT GAAAATTCTA ATGAAGTATC AGCAACTCCC
ACCAATCCAA CAGTAACCCT TGAAGTAACT TCAGTTGACA AAGCAAAGCT GGACGATGAA
ATAACTGCAA ACATAGTAAT TCATAACGCT GTAAACATAT GTGCCGAAGA CCTAAAAATA
TCTTATGATA CTTCCAAACT ACAATTCATA AATGCGGAAA ATGCAGATGG CATGAAAATT
TATAAAGAGG ATGATATCGC TGCCGGAGTA AAAAGATACA TAACGGCTTG TCTCGGTAAA
GCCAATGCTG CAAATGGCGA TAAAATATTA CTAAAATTGA AGTTCAAAGC TATTGATAAG
GGTGAAGCAA AGATTGATAT AACAAACGGA CGTATTGCAG ATAATGTAAC TCTGGAAATG
GATGTTTCAC AGGAAAACTG TGGCGAAAAG ACAATTTTGA TAGAAGGGGT AAAAGACGTT
AACCGTACAG GTGAATACAC TCTTTTGGAT TTAGGTATTG ATGCATGGTA TTACGGGTAT
GCCGCAGTTG ACACAGATAC CAGCAAATAT GACGCCGACC AGATAATCAA TGGATCAATT
GACGATGATG ACTTAACTGA AATAGTAGCT CAAATACTTG CTAATACGAA CTATTCTGCG
AATAAATAA
 
Protein sequence
MNYKKVLFTC FFLLALVIPV SANATTEKNI IDLSKWQNVG INYSGSEPAP IWKVNSTNTA 
VTQTLNARPA VFMGDMECAN NSVRGSFSVD TTSDNDFVGF VFGYKDSGHF YLFDWKQSDE
DFHGLGKQGM SIKKVNTNTS LSDSDLWPTS NTDKVKVLYH NNITYKDQML YNFTLNFTDK
GSFNIVIKQG NTVLDSITIK DSTYTSGKFG FYNYSQEMVT YSVSEFEKLP PVVKVTSTQN
EKVDLSWTAV EGATSYNIKR DTTSGGPYTT IGQSTSTTYT DTTVANGTTY YYVVTAVNTG
GESENSNEVS AKPIAPAKAP INLVAKANNA KVDLVWSASQ SATSYNIKRA TTAGGPYTTI
GQSTSTTYTD TTVANGTTYY YVVTAVNAGG ESENSNEVSA RPIAPAKAPI NLVAKANNAK
VDLVWSASQS ATSYNIKRAT TAGGPYTTIG QSTSTTYTDT TVANGTTYYY VVTAVNAGGE
SENSNEVSAK PIAPAKAPIN LVAKANNAKV DLVWSASQSA TSYNIKRSTT AGGPYTTIGQ
STSTTYTDTT VANGTTYYYV VTAVNAGGES ENSNEVSATP TNPTVTLEVT SVDKAKLDDE
ITANIVIHNA VNICAEDLKI SYDTSKLQFI NAENADGMKI YKEDDIAAGV KRYITACLGK
ANAANGDKIL LKLKFKAIDK GEAKIDITNG RIADNVTLEM DVSQENCGEK TILIEGVKDV
NRTGEYTLLD LGIDAWYYGY AAVDTDTSKY DADQIINGSI DDDDLTEIVA QILANTNYSA
NK