Gene Ccel_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2153 
Symbol 
ID7312329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2522834 
End bp2524486 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content38% 
IMG OID643609084 
ProductYD repeat protein 
Protein accessionYP_002506475 
Protein GI220929566 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0957707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAC CTCGGAAGCG ATTGTATATA CAGGAGCCTG ACGGAAAATC CAATTATACT 
GAATATTTTC AATATGGAGG TAAAAACAAA TATGGCGATA TAAAATCACA AACCGACCGT
AATGGAAATA AAACTCAGTA TGAGATTGAT GCTAGAGGCA ATGTAACCAA AATCACAAAT
CCTGACGGAA GTACCCAGCT AAAAGAATAC GATGAAAAAA ACAATGTAAC AAAAGAAGTG
GATGAGTGCG GGAAAACAAC CTACAATGTT TACGATGAAA ACAAAATAAA CCTGATAAAG
AAGGTACAAC CCCTTAACGA AACAGATGTA TACGACGGCA CAAACAATAC AGGTTTTGCA
ATAACTTTCT ACCAATATTA TACAGGAGAA GAATCGGGTT CATCTGCAAA GGGATTATTA
AAAAGTGAAA CAGACCCGGA GGGCAATACC ACCACGTACA CCTACAACAC TTATGGTGAT
GTAAAAGCAG TATCTGACCC AGAGACAGGC AAAGTTACAA CTTATGAGTA TAACCGCATA
GGCTGGAAAA CCGCACAAAT AACCCAAAAG GGCAACAGAA CAGAATTTAC CTACGACAAA
AACGGCCAGT TAATTAAAAC TACAACAGTA AGCTCAAAAA ATGAAACACA AAGAACGATA
TTTGACCTGT TGGGAAGAAA AATACAAGAA ATTACTCCAA ACCAATATGA CGATACAAAG
GATAATGTAG AAGCCGATAC ATATACTGAC AATACGGTGG GAACAAAATA CGAATACTTT
GATAGTGGTA AAATCAAGGA AGTAACCAAT GCATTAGGAG AAACGACAAG CTACACCTAC
GACGTATACG GGAACACGCT GACAGAAGCA AAACCCAACG GTGCAATTTA CAGGTATGAA
TATGATGTTC TGGACAGACT ACTTAAAATT TACTTCAGAG ATAATTCATC AGTAGCTGAA
GAACTACTTA CCCAATACAG TTATGCAACT TTGGAGGACG GAAAAACACA AACAACTGAA
ACAAAATATC TGAATTCCAA AGACAAGGCT GTAACGGTTT ACATATATGA CTATGCCGAC
AGGCTTGTAG AACAGCAGAA TCCTGACGCT ACAAAGCAGA GGACAATATA CAACGCAAAC
GGAACAATTA ACAGACAGAT TGCAGCTAAC GGAAGCAGTA CATACTTCAA ATATGATGGC
TTAAACAGAT TGACAGAACA ATGGGCCCCT TTTGAAGTAT CAAACGGAAA TACCCTGTAC
ACTTACAACA AAACCGAATA TGACAAGGCT GGAAGAAAAT CTGCGGTGAA ATCAGGCAAA
GACAAGGTAA CCCTGTGGTC AATACCTGAA AGCCTTGCAA TAACAAACTA CCAGTATTAC
AAAAACGGTA ACGTCAGCCA GACAAGGGAT TCTGAAGGAA GAAAGACAGA ATACCTGTAC
GATGATGACG GAAATGTTAT AAAAGAAAGT GTATATACCA ATGCAACCAA CAAGCTAGTA
ACGGATTATA CATACAATTA CCTTGGAAAG CTGGACAAAA AGGAGCAACA TGTAAAAACC
GGAGACCTGT ACGGAAAAGA CTTCAACGAC AAACGGACAC TTTACTTACA ACCTCCTACA
CTTATGACAA GAATGGTAAT ACAAAAACTG TGA
 
Protein sequence
MKIPRKRLYI QEPDGKSNYT EYFQYGGKNK YGDIKSQTDR NGNKTQYEID ARGNVTKITN 
PDGSTQLKEY DEKNNVTKEV DECGKTTYNV YDENKINLIK KVQPLNETDV YDGTNNTGFA
ITFYQYYTGE ESGSSAKGLL KSETDPEGNT TTYTYNTYGD VKAVSDPETG KVTTYEYNRI
GWKTAQITQK GNRTEFTYDK NGQLIKTTTV SSKNETQRTI FDLLGRKIQE ITPNQYDDTK
DNVEADTYTD NTVGTKYEYF DSGKIKEVTN ALGETTSYTY DVYGNTLTEA KPNGAIYRYE
YDVLDRLLKI YFRDNSSVAE ELLTQYSYAT LEDGKTQTTE TKYLNSKDKA VTVYIYDYAD
RLVEQQNPDA TKQRTIYNAN GTINRQIAAN GSSTYFKYDG LNRLTEQWAP FEVSNGNTLY
TYNKTEYDKA GRKSAVKSGK DKVTLWSIPE SLAITNYQYY KNGNVSQTRD SEGRKTEYLY
DDDGNVIKES VYTNATNKLV TDYTYNYLGK LDKKEQHVKT GDLYGKDFND KRTLYLQPPT
LMTRMVIQKL