Gene Ccel_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2789 
Symbol 
ID7311414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3348169 
End bp3349281 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content34% 
IMG OID643609688 
Producthypothetical protein 
Protein accessionYP_002507067 
Protein GI220930158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000791032 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACTA AATTATTTAA GAAAGTCATA ACACTTGCTT TAAGTTCTAT TATTCTACTT 
TCATCTATGG CATATGCGGC TGTAGGTCAA CCATATGACA GATTAAGTAT GTCATTTACC
GGGAATCGTT ATATCTACTC AAATAATCCG GAAAAAATTA CAACTACACT CATAGCTGGA
ACAACTGGAA AAAATATAGT AAACAGGACT GTTACCCCAG GTCAAAAATA TGATGTTGAG
TATAGTCACA CAAACATGAC CACAATTCCT TTAACAGTGG CTGTTATTTT AAAGAATAAT
ACAAATGTGA AAGCTAATAT TGCTATCTAT AACAATGCTG CTTATCAAAA TGAAGCATAT
GATGTGGTTG GTTCAAAGAC TGAATTTGAT TACTGGAGTA TTGCAAAATA TTCGACAAAG
ACCATTGATC CAGGAAAGTC ATGCGTACTT ATTCGTTCTA ATCTCAATGC TGCTGATTAT
TCTACCGGTG TGGGAAAAGT TCTTTTCCAA TCTGATGTTG CTGTAAACTG TAGAGTTGCT
TATTTTAAAA CCGGAACTTC AGATGATGCA GCAGGCAATC TTAGTGACTT AACAGCAGGA
GATACCGTTT CTACAACTAC TAATGAATGT CTTAATGATG GAAGAACAAT AATTTATGAT
TATGCGTCAG CTATTAATAA AGCATTCTAC CTTAATATGG ATGTTGCATA TACAGATTCA
ACAGGAAAAG TCTTAGATCC ACATAATAAA TATAATGCAA ATGAGTTCCA GAAACCTGTG
TGGTACCTAC CATCTTCAAG AGAATACTCA CAAGGTAATT GGGGCATTAA TTACAATATG
CGTTTAGATA ATGCTGGTGG AAAAACTTTA TATATTATTC CAGATTGGAC TAATATAAAA
TACCTCGGAT CTAACTCTTA TACTATTTAT GACCCATATA CCTCTTCATG GAAGAACATC
AAACTTTATA AAAGCTCTTT GGATTATGCT GTAATAAAAC TTCCGAATAA ATCAAGCATG
ATTTTTAATT TCGTTTTAGC CGGAGGGGAT TGCGGACAAG AATATTTTAC ATTTACTGCT
CCGACAACAG GACATGCAGT ACCCGTTAAC TAA
 
Protein sequence
MRTKLFKKVI TLALSSIILL SSMAYAAVGQ PYDRLSMSFT GNRYIYSNNP EKITTTLIAG 
TTGKNIVNRT VTPGQKYDVE YSHTNMTTIP LTVAVILKNN TNVKANIAIY NNAAYQNEAY
DVVGSKTEFD YWSIAKYSTK TIDPGKSCVL IRSNLNAADY STGVGKVLFQ SDVAVNCRVA
YFKTGTSDDA AGNLSDLTAG DTVSTTTNEC LNDGRTIIYD YASAINKAFY LNMDVAYTDS
TGKVLDPHNK YNANEFQKPV WYLPSSREYS QGNWGINYNM RLDNAGGKTL YIIPDWTNIK
YLGSNSYTIY DPYTSSWKNI KLYKSSLDYA VIKLPNKSSM IFNFVLAGGD CGQEYFTFTA
PTTGHAVPVN