Gene Ccel_2933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2933 
Symbol 
ID7311547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3491121 
End bp3492497 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content41% 
IMG OID643609833 
ProductO-antigen polymerase 
Protein accessionYP_002507207 
Protein GI220930298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000205816 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAACTGCTGT AATATTATCA ATATTGGGAT TGCTGGCAGG GGTATGCGGA 
GCTTTTGCAC CGACCTTTCT GCTGATTGCA TTTATTGCAG GCGTAATATT TACTGCCGCA
ATGATGTTTG ATTATTCAAA ATTCCTATAT GTTCTAGGGT TTTATGTTTT AATTGACTAC
GTATTGAGGT ATGTCATTTC AAGTGCTTTG CTTGCAAGCT TCTGGGATGA ACTACTGTTT
ATATTTGCAC TGTGCTTGTG GCTCTACAAA TGGGTAGTTT ACCGTAAACA GGATGGCTAT
GTTGCAACAC CTTTGGATAT ACCTCTTGTA TTTTTTGTGG TAATCTCCAT ATGTTTGCTT
CTTGTTAATT CACCTGTTAT GGCCATCGGA ATTGCCGGAT TCAGACAGGT TATACAGCAG
ATGCTATGGT ATTTTATTGT TGCACAGCTT GTAACCTCCA GCAGAAACAT CAGATGGTTT
TTGTACATTA TGGTTTTTAT CGGAGGACTT CTTGGGCTTC ATGGTATTTA CCAATATGTT
ACCCATGCAG AGATGCCGTC TTATTGGGTT GACAGGCTTG AATCGGGTAT AACAACAAGG
GTTTTCTCCA TTATTGGAAG CCCTAATATT CTGGGAAGCC TTATGGTACT GCTGATACCC
GTTGCGATTT CATTTGTTTT CAGTGAAAGA AAGATTTTCA AGAAAATCAT ATTTACCGGA
ATAACCCTTG CAATGTCTGC AACCCTTATA TTCACATCCT CAAGGTCCGC TTGGATAGGA
TTTGTAGTAG CAATGGGCGT GTATTTCTGG CTTAAAGACA AGAGGCTTAT TTTATTGCTG
GCTCTGCTGG TTGTTGCCGC ATATTTTGCA GTACCTACCA TTGCACACAG AGTAAACTAT
CTGTTAAGTC CTCAATATAT GATAAGCAGT GCAGCTGGTG GAAGAATTGC AAGATGGTCC
ATAGGTATTG CGGCACTTCA GCAGCATCCA TGGTTCGGCC TCGGACTTGG ACAATTCGGC
GGTGCAGTTG CACAGAACTT TAAGATACCG GGTGCATTCT ATGTCGACAG CTATTTTTTA
AAGATTGCTG TTGAAATGGG TATTGTTGGC TTAACATCCT TCTGTATTCT TATATATAAC
GCTCTGGCAT GGGGAATACG TGCGGTTAAG AGAACAGCAG ACCGACAAAG CCTCAGCATG
GCACAGGGTG TTTTTGCGGG AATGGTTGGC ATTGTGGTTC CGAACTTTGT AGAAAATGTA
TTTGAGGTTC CAATGATGAC AGCTTATTTC TGGATGTTTG CAGCCGTCCT GATAGCCCTC
GGCTTCACCC TTCCTAATAA GGGGTTAAAC CGTCTTAACG TAGGAAGTAT AAGATAG
 
Protein sequence
MNKKTAVILS ILGLLAGVCG AFAPTFLLIA FIAGVIFTAA MMFDYSKFLY VLGFYVLIDY 
VLRYVISSAL LASFWDELLF IFALCLWLYK WVVYRKQDGY VATPLDIPLV FFVVISICLL
LVNSPVMAIG IAGFRQVIQQ MLWYFIVAQL VTSSRNIRWF LYIMVFIGGL LGLHGIYQYV
THAEMPSYWV DRLESGITTR VFSIIGSPNI LGSLMVLLIP VAISFVFSER KIFKKIIFTG
ITLAMSATLI FTSSRSAWIG FVVAMGVYFW LKDKRLILLL ALLVVAAYFA VPTIAHRVNY
LLSPQYMISS AAGGRIARWS IGIAALQQHP WFGLGLGQFG GAVAQNFKIP GAFYVDSYFL
KIAVEMGIVG LTSFCILIYN ALAWGIRAVK RTADRQSLSM AQGVFAGMVG IVVPNFVENV
FEVPMMTAYF WMFAAVLIAL GFTLPNKGLN RLNVGSIR