Gene Ccel_3404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3404 
Symbol 
ID7311966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3952563 
End bp3954167 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content42% 
IMG OID643610308 
Producthypothetical protein 
Protein accessionYP_002507672 
Protein GI220930763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.515631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGAA TGCCAATTTA TCAAGATTCT CCGACGAATC TAAAAAATCA GATTTTTGCA 
GCAAACGGAT CATCTGTAGT TAATGTTCAG GCTGACAATA CAGGTAGATT AAAGGTTGCA
ACCGACAGTT CGTCCCCACT CGCTGTTGAC GTAGATGAAG CTGTAAACAG TATAACTGTT
TATGGCAGTG ACGGTACAAG TAACCAAGTT TTAAGAACAA CTGCTACGGG TCAGCTGGAT
ATCAGGCCTC TTACCGTTTC AGATACTGTC AATGTGAGTA TTACTCAAGC AGATGACAGT
ATTACTGTCT ATGGTAATGA CGGTACTGCA AACCAGATAA TTAAAACTAA CTCCACAGGC
CAACTGGATA TCAGACCTCT GACTTCTTCT GATACCGTCA GCGTTGATGT TTCCCAAGCT
ACCGATAGTA TTGCTGTTTA TGGTAATGAC GGTACTGCAA ACCAGATAAT TAAAACTAAC
TCCACAGGCC AACTGGATAT CAGACCTCTG ACTTCTTCTG ATACCGTCAG CGTTGATGTT
TCTCAAGCTA CCGATAGTAT TGCTGTTTAT GGTAATGACG GTACTGCAAA CCAGATAATT
AAAACTAACT CCACAGGCCA ACTGGATATC AGACCTCTGA CTTCTTCTGA TACCGTCAGC
GTTGATGTTT CTCAAGCTAC CGATAGTATC GCTGTTTATG GTAATGACGG TACTGCAAAC
CAGATAATTA AAACTAACTC CACAGGCCAA CTGGATATCA GACCTCTGAC TTCTTCTGAT
ACCGTCAGTG TTGATGTTTC CCAAGCTACC GATAGTATCG CTGTTTATGG TAATGACGGT
ACTGCAAACC AGATAATTAA AACTAACTCC ACAGGCCAAC TGGATATCAG ACCTCTGACT
TCTTCTGATA CCGTCAGCGT TGATGTTTCC CAAGCTACCG ATAGTATTGC TGTTTATGGT
AATGACGGTA CTGCAAACCA GATAATTAAA ACTAACTCCA CAGGCCAACT GGATATCAGA
CCTCTGACTT CTTCTGATAC CGTCAGCGTT GATGTTTCTC AAGCTACCGA TAGTATCGCT
GTTTATGGTA ATGACGGTAC TGCCAATCAG ATAATTAAAA CTAACTCCAC AGGCCAACTG
GATATCAGAC CTCTGACTTC TTCCGATACC GTAAACGTTG ATATTTCTCA ATCTACCGAT
AGTATTGCTG TATACGGTAG TGACGGTACT GCCAATCACG CTTTATTAAC TGATTCGGCC
GGAATACTAC AGGTTAACAA TACCCGGACC TTTACAACTG CTACTCTTAC AACTTTAGAA
ACAACAGACA GCTATCAATA TACAACCCAA CAGGAGATTG CTCAACTGAA CACCTATCAG
TTCTTTGTAA AGAATACAGG AGATACAAAC AGTGTTACAC TTGTTGTTGA ATTGAGCCCA
AATGGTACAG ACTGGGTAGT TGACAGTGAC GAACGTCCGA TTACCTTCGG GGCTGCAACA
ATTATAACTT CGAACAAGTT CCTAAGATAT ATAAGATTAG GATACAAGTC CACAAGTACT
GGTGCCAGCA CAACTATAAG TGCTATTTTC CAAGGCCAAG GCTAA
 
Protein sequence
MPGMPIYQDS PTNLKNQIFA ANGSSVVNVQ ADNTGRLKVA TDSSSPLAVD VDEAVNSITV 
YGSDGTSNQV LRTTATGQLD IRPLTVSDTV NVSITQADDS ITVYGNDGTA NQIIKTNSTG
QLDIRPLTSS DTVSVDVSQA TDSIAVYGND GTANQIIKTN STGQLDIRPL TSSDTVSVDV
SQATDSIAVY GNDGTANQII KTNSTGQLDI RPLTSSDTVS VDVSQATDSI AVYGNDGTAN
QIIKTNSTGQ LDIRPLTSSD TVSVDVSQAT DSIAVYGNDG TANQIIKTNS TGQLDIRPLT
SSDTVSVDVS QATDSIAVYG NDGTANQIIK TNSTGQLDIR PLTSSDTVSV DVSQATDSIA
VYGNDGTANQ IIKTNSTGQL DIRPLTSSDT VNVDISQSTD SIAVYGSDGT ANHALLTDSA
GILQVNNTRT FTTATLTTLE TTDSYQYTTQ QEIAQLNTYQ FFVKNTGDTN SVTLVVELSP
NGTDWVVDSD ERPITFGAAT IITSNKFLRY IRLGYKSTST GASTTISAIF QGQG