Gene Ccel_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2834 
Symbol 
ID7311454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3387912 
End bp3389105 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content45% 
IMG OID643609729 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002507108 
Protein GI220930199 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.174812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTCTTGAATT GCGTGAGAAA CGCGCTAAAG CTTGGGACGC AGCAAAGGCA 
TTCCTTGATT CAAAACGTGG CGGTGACGGA CTGTTATCCG CCGAGGACAC GACAACCTAT
GAAAAAATGG AAGCCGATGT GGTGGCTCTT GGTAAGGAAA TTGAGCGTTT AGAACGCCAA
GCATCTATCG ACTTAGAACT GTCCAAAGCA ACCAGTAACC CAATTACGAA CGAACCTACT
AGAACTGGAG AGGAAAAGAC CGGCCTTGCA AGTGCTGAAT ACAAAAAAGC TTTCTGGAAT
GCGATGCGTG ACAATGTCAG CTATGAAGTA AGGAACGCTC TAAAGATTGG CACTGATTCT
GAAGGTGGAT TTCTTGTACC TGATGAGTTT GAGCGTACCC TAGTAGAAGC TCTAGAGGAA
GAAAATATTT TCCGTAGATT GGCCAATGTA ATCACTACAT CTTCTGGTGA CCGCAAGATT
CCTGTTGTTG CAAGCAAAGG CAATGCAAGC TGGATCGATG AAGAAGGGGC TATTCCAGAG
AGTGATGACA GCTTCGGTCA AGTATCCATC GGTGCTTATA AACTAGCAAC GATGATTAAA
GTCTCTGAGG AACTGCTAAA TGATTCCGTG TTTAATCTCG AAAGCTACAT CACAAGAGAA
TTCGCACGTC GCATTGGTAA CAAGGAGGAA GAAGCCTTCT TTGTAGGTGA TGGCACAGGT
AAGCCAACAG GAATTCTAAA TGCCACAGGC GGTGGTCAAG TTGGTGTTAC TGCGGCAAGT
GCCACTGCCA TCACTTTGGA TGAGGTATTA GATTTATTCT ACAGCTTAAA AGCACCGTAT
CGTAATAAGG CAGTATTCGT AATGAACGAT GCCACTATAA AGGCTATTCG TAAATTGAAA
GACGGTAATG GGCAATACCT ATGGCAACCT TCCATCCAAG CGGGAACACC TGATACGATT
CTTAACCGCC CGCTGTATAC CTCATCATAT GTACCTACTG CTGAAGCTGG TGCAAAGACT
GTGGTATTCG GTGATTTTAG TTATTACTGG GTGGCAGACC GTCAAGGACG AGTATTCAAA
CGCTTAAATG AACTCTATGC TGTCACAGGT CAAGTAGGAT TTATTGCGAC TCAACGAGTT
GACGGAAAGC TTATCTTACC GGAGGCCGTT AAGGTACTCC AACAGAAAGC CTAA
 
Protein sequence
MSKILELREK RAKAWDAAKA FLDSKRGGDG LLSAEDTTTY EKMEADVVAL GKEIERLERQ 
ASIDLELSKA TSNPITNEPT RTGEEKTGLA SAEYKKAFWN AMRDNVSYEV RNALKIGTDS
EGGFLVPDEF ERTLVEALEE ENIFRRLANV ITTSSGDRKI PVVASKGNAS WIDEEGAIPE
SDDSFGQVSI GAYKLATMIK VSEELLNDSV FNLESYITRE FARRIGNKEE EAFFVGDGTG
KPTGILNATG GGQVGVTAAS ATAITLDEVL DLFYSLKAPY RNKAVFVMND ATIKAIRKLK
DGNGQYLWQP SIQAGTPDTI LNRPLYTSSY VPTAEAGAKT VVFGDFSYYW VADRQGRVFK
RLNELYAVTG QVGFIATQRV DGKLILPEAV KVLQQKA