Gene Ccel_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2012 
Symbol 
ID7310721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2373103 
End bp2374347 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content35% 
IMG OID643608945 
ProductParallel beta-helix repeat protein 
Protein accessionYP_002506338 
Protein GI220929429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAT TGAATAAACT ATTTTTTTCA ATACTAATAG TTCTAATGGC AATAGGATTA 
ATTCCTGTAA ATAGTCAAAA AAATATTGTA TATGCAGAAA CATATAACTC GACAGATACC
TCAGTAGAGG CAGATACTAT TCATGTTAGT GGAATTATTT CTGAAAACAC TACATGGACT
AACAACTATA CTTATGTAGT TGATGGAACC ATAATTGTTC AACAAGGTGT AAAACTTCAG
ATTGATCAAG GAGTAATGGT GAAGTTTACC CGGGGAACAG AAATTGTTGT AAATGGTACA
CTGACAGCTT CAGGGACTGA ATTGGACAAA GTTGTATTTA CAAGCAATAA TGATACGGCA
TATGGAGGAA GCGAAGTAAC AGGATACTAT GATTACTGGC TTGGAATAAC TGTATCAAGT
ACGGGAGAAT TCAATGGTGA TTATATAAAA GTTAGGTATG CAGGGGCAGA TTCCAATTAT
TATACTCCTC ATTGTGCAAT TAATGTTCAA GGAAAGCTTA ACCTTACAAA TTCAGAAGTA
AGTAACTCTA AAAATTATGG TATATACCTG AATACAAGCT TTGATTCAGC AATAAAAAAC
AATAATATAA TTAGTAATCG ATCAACTGGG ATATATATTA ACAACACAAG TGCAGATGCC
ACTAATACCA TGGATATTGA GAATAATACC ATATCTGGTA ATGGCGGATG TGGAATTTAT
GTATCTCAGG CTGGGACAGG AAATGCTGTA ATAGAAGGAA ATAGGATAGC GGGCAATGGA
GGGTCAGGCA TCTATATAGA TATATTTGGG ACAGGAAATT TAAGTGTAAG AAATAACAAT
CTGTTAAATA ATACGGAAAG TGGAATATAT GTTTATCTGG GAGGATTAAG TTCATCAATA
TTCACAGGAA TAGCCGACAA TACCTATGAA GGAAATACCA TAAGAGGAAC ATTGTGTAAC
GGAGTGGGTA TTGGAGGAAA CACAATTGTG GATATAACAC TAAGTAATGC TGTGTACTAT
TTAGCTGATT TTGTATTGGT ACCAAACGAT AAAAAATTGA CGGTACAGCC AGGAACAATA
ATAAAAAGCA GTTCAAAAGG AAGTAGTATC TACGTATATG GAAAACTCAA TGCCTTGGGA
ACGAAAGGCA GTCCTATTGT ATTTACAAGC CATAAAGATG CGGCATATGG AGGAAGTGGG
ATAGCCGGAT ACTATGATTA TTGTGACCTT TCCCTGATAA ATTAG
 
Protein sequence
MSKLNKLFFS ILIVLMAIGL IPVNSQKNIV YAETYNSTDT SVEADTIHVS GIISENTTWT 
NNYTYVVDGT IIVQQGVKLQ IDQGVMVKFT RGTEIVVNGT LTASGTELDK VVFTSNNDTA
YGGSEVTGYY DYWLGITVSS TGEFNGDYIK VRYAGADSNY YTPHCAINVQ GKLNLTNSEV
SNSKNYGIYL NTSFDSAIKN NNIISNRSTG IYINNTSADA TNTMDIENNT ISGNGGCGIY
VSQAGTGNAV IEGNRIAGNG GSGIYIDIFG TGNLSVRNNN LLNNTESGIY VYLGGLSSSI
FTGIADNTYE GNTIRGTLCN GVGIGGNTIV DITLSNAVYY LADFVLVPND KKLTVQPGTI
IKSSSKGSSI YVYGKLNALG TKGSPIVFTS HKDAAYGGSG IAGYYDYCDL SLIN