Gene Ccel_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1020 
Symbol 
ID7309844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1269972 
End bp1272251 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content44% 
IMG OID643607947 
Producthypothetical protein 
Protein accessionYP_002505362 
Protein GI220928453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAA AACTATGGTA CAAATCACCG GCAAAGGAAT GGAATGAAGC ACTACCAATA 
GGTAATGGAA GACTTGGGGC TATGGTATAC GGCTGTGTGA AAAATGAAAA TATACAGCTT
AATGAGGACA GCATCTGGTA CGGTGATCCC ATAGACAGAA ACAATCCAGA CGCACTGGCC
AATTTAGCTG AAATACGTAA CTTCTTGTCA GATGGAAGGA TTAAAGAGGC CGAAAAGCTA
GCGGTTCTGT CCCTTTCCGG AGTACCGGAG AGTCAGAGGC CATATCAGAC TTTGGGGAAC
TTAAAACTAA ATTTTGAGAT TGATGAGAGT GACATAAGAG ATTACTCTAG AGAGCTGGAT
ATTGAAAATG CATGTGCTTC TGTCAAATTT GTTTCAAAGG GAGTTATGTA TACCAGAGAG
TATTTTGCAA GTGCAGTCGA TCAGGTCATA GTTGTTCGTC TGTTTGCCGA TGCCCCCGGC
AAGATAAGCT TCACAGCCAA TATGAGAAGA GGAAGGTTCC TTGATAATTC TGGGGCTATA
GATGGTAAAA CCATCGGAAT GTTTGCAAGC TGTGGTAGCG ACAAGGGTGT AAGATTTTGT
TCAATGGTCA GGGCGGTTTC CGAGGGCGGG AAAGTCAATA CCATCGGTGA AAATCTGATT
GTTGAGGAAG CCGATGCCGT AACTTTGCTT ATTTCTACTG CTACCAGCTT CTATCACAAG
GAATATGAAA CACAGTGTCT TAAATATCTT GACGGAGTAG AAGAAAAAAC ATATACAGAG
CTGATGTCAA ACCACATTGA GGATTATTCT CAATTATACG GAAGAGTTGA GCTTGAAATA
GGAAATGCTG AAGAGCATGA CAAAATTCAA AGCCTGGATA CGGCTGAGAG ATTGGAGAGA
CTTGAGAGCG GAAAACCCGA CCACCAGCTA GAATGCCTTT ATTTCAGCTT CGGAAGATAT
CTGCTTATAT CTTGCAGCCG CCCGGGAAGT CTGCCTGCAA ATTTGCAGGG TATCTGGAAC
CAGGATATAC TTCCTGCTTG GGATAGTAAA TATACTATTA ATATAAATAC CGAAATGAAC
TACTGGCCTG CAGAGACATG TAATCTTTCG GAGTGCCACT TTCCGCTATT TGATCACATT
GAAAGGATGA GAGCACCAGG CAGAAGAACC GCTAGGGTAA TGTACGGATG CAGCGGTTTT
GTAGCACACC ATAATACCGA CATATGGGGA GATACGGCTC CACAGGATAT CTACATTCCG
GCAACTTACT GGCCAATGGG TGCAGCGTGG CTCTCACTTC ACCTGTGGGA ACATTATGAA
TTCGGTTTAG ACAAGGAGTT TTTGAAAGAT GCCTATCCCG TAATGAAAGA GGCAGCCCAA
TTTTTTCTTG ACTTCTTAAT CGAGGACAGT AAGGGAAGAC TTGTAACAAG TCCTTCTGTT
TCACCGGAGA ATACATATAT ACTGGAAAAC GGTGAGAAGG GCTGCTTATG TATCGGACCT
TCCATGGACA GCCAGATATT ATATGCACTT TTCAGCGGAT GTATAGAAGC TTCAAATATA
CTTGATACCG ACATATCCTT TGCGGAGAAG CTGATAAAAG TCAGGGACAG CCTCCCAAAG
CCCCAGATAG GACGATACGG GCAAATTCAG GAATGGTCAG AGGACTATGA GGAGGAAGAG
CCCGGACACA GGCATATATC ACATTTGTTT GGCTTACATC CGGGAAAGCA GTTCAGTACG
AGGAAAACGC CCGAACTGGC AACAGCAGCC AGAAAAACTC TCGAAAGGCG GTTGGCCAAT
GGCGGAGGTC ATACAGGGTG GAGCAGAGCT TGGATAATTA ATATGTGGGC CAGACTGAAG
GACGGAGAAA AGGCATATGA AAATGTTGTG GACCTTTTGA AAAAATCCAC TTTGCCAAAT
TTGTTTGACA ACCATCCGCC ATTCCAGATA GATGGCAATT TCGGAGGTGC TGCAGGTATA
GCGGAAATGC TTTTGCAAAG TCATGAGGGC GGTATAGAAT TTCTCCCTGC TCTTCCGGGT
GCCTGGAGTG AAGGAAGGGT AAAAGGCTTA GTTGCCCGTG GGAATTTCGA GGTGGAAATG
GAATGGAAGG ACGGCAAGCT CAACCGTGCC ACTATCCTTT CACGCAGCGG CGGAAACTGC
AAGATATTCA CTTCACTTAA ATATCGGGTG ACAAGTGACG GAAAACCTGT GGATACTGTA
CAAGACGGAC AAGTTATGTC TTTTACTACC ACTGAGGGTA AGAAATATGT AATTGAGTAA
 
Protein sequence
MKQKLWYKSP AKEWNEALPI GNGRLGAMVY GCVKNENIQL NEDSIWYGDP IDRNNPDALA 
NLAEIRNFLS DGRIKEAEKL AVLSLSGVPE SQRPYQTLGN LKLNFEIDES DIRDYSRELD
IENACASVKF VSKGVMYTRE YFASAVDQVI VVRLFADAPG KISFTANMRR GRFLDNSGAI
DGKTIGMFAS CGSDKGVRFC SMVRAVSEGG KVNTIGENLI VEEADAVTLL ISTATSFYHK
EYETQCLKYL DGVEEKTYTE LMSNHIEDYS QLYGRVELEI GNAEEHDKIQ SLDTAERLER
LESGKPDHQL ECLYFSFGRY LLISCSRPGS LPANLQGIWN QDILPAWDSK YTININTEMN
YWPAETCNLS ECHFPLFDHI ERMRAPGRRT ARVMYGCSGF VAHHNTDIWG DTAPQDIYIP
ATYWPMGAAW LSLHLWEHYE FGLDKEFLKD AYPVMKEAAQ FFLDFLIEDS KGRLVTSPSV
SPENTYILEN GEKGCLCIGP SMDSQILYAL FSGCIEASNI LDTDISFAEK LIKVRDSLPK
PQIGRYGQIQ EWSEDYEEEE PGHRHISHLF GLHPGKQFST RKTPELATAA RKTLERRLAN
GGGHTGWSRA WIINMWARLK DGEKAYENVV DLLKKSTLPN LFDNHPPFQI DGNFGGAAGI
AEMLLQSHEG GIEFLPALPG AWSEGRVKGL VARGNFEVEM EWKDGKLNRA TILSRSGGNC
KIFTSLKYRV TSDGKPVDTV QDGQVMSFTT TEGKKYVIE