Gene Ccel_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3066 
Symbol 
ID7312444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3609595 
End bp3610830 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content40% 
IMG OID643609968 
Producthypothetical protein 
Protein accessionYP_002507338 
Protein GI220930429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAACA CTCAAAGGGC CCACGCAATA CTCTCGGCTT CAGGTTCAAA ACGATGGCTT 
AGTTGCCCAC CAAGTGCAAA GCTCGAAGAA CAATTCCCGG AAAGTACAAG CGAATTCGCA
GAAGAAGGTA CATATGCCCA CAGCTTCGCA GAATTAAAGC TCCGAGGATA TATAACTACG
GACCTCAAGC CAAGTGTCTA TAAAAAGAAA TTAGCAGAGA TTAAGAAAGA TCCTTTTTAT
AGTCAAAGCT TAGATGATTA CATAGAACAA TATATCAATA TTGTGGGTGA GAAATACCTT
GCTGCAAAAA AGAATAGTTC AGATTCTTTT GTAATGCTCG AGCAGAAACT TGATTTTTCA
GAATGGGTTC CTGATGGTTT TGGAACTGGG GACGTAGTTT TGATATCTCC AGGGATTCTT
GAAATTGTGG ACCTGAAATA TGGACAAGGT GTCCCTGTAT CCGCCGAAGG AAATACCCAA
ATGCGATTAT ATGCCCTTGG TGCACTTAAT CAATATGGTA TGTTGTATGA CTTCGATAAA
ATCAAAATGA CTATTATTCA GCCTAGACTT GACAGTATAT CCGAGGATGA AATAACGGTT
CAGGAATTGC TTGACTGGGG AGAGTCAGTT GTTAAGCCTA CTGCGGATAT GGCAATTGCC
GGTGAAGGAG AATTCAAGTC GGGAGATCAC TGTCAGTTTT GTAGAGCTAA AGCAGTATGC
AGAAAAAGGG CTGAGGATAA TCTTGAAATG GCTAGATACG AATTCGAAGA TCCTAATATC
TTATCAAATG ATGAGATAGC AGATATTCTA GCTAAAGCAG CAGAGCTCCA AAAATGGGCA
TCAGATGTAC AAGCCTATGC ACTCGATCAA GCAGAGAATC ATGGGGTTAA ATTTACTGGT
TGGAAGCTGG TCGAGGGTAG AAGTAATAGA AAATATACGG ATGAAGATGC TGTGGCTACA
AAGCTGAAAG ACGAGGGTTA CGCATCGGAT GTTATATACC AGCCACAAAA AATCTGGGGT
ATTAGCGAAA TGGAGAAAAA GATTGGTAAA AGGCTTTTCG CTGACTATCT TACTGAATTT
GTTGTTAAAC CAGCAGGTAA AGCAACTCTT GTTCCAGAGA GTGATAAACG CCCGGAGATA
TCATCCGTAG CATCAGCAGT AAGAGATTTT GATGACCTTT ATGAAAACAA GCTTCAACAT
GAAACAGAAA AAATTCCAGA CGATATTTTA AATTGA
 
Protein sequence
MGNTQRAHAI LSASGSKRWL SCPPSAKLEE QFPESTSEFA EEGTYAHSFA ELKLRGYITT 
DLKPSVYKKK LAEIKKDPFY SQSLDDYIEQ YINIVGEKYL AAKKNSSDSF VMLEQKLDFS
EWVPDGFGTG DVVLISPGIL EIVDLKYGQG VPVSAEGNTQ MRLYALGALN QYGMLYDFDK
IKMTIIQPRL DSISEDEITV QELLDWGESV VKPTADMAIA GEGEFKSGDH CQFCRAKAVC
RKRAEDNLEM ARYEFEDPNI LSNDEIADIL AKAAELQKWA SDVQAYALDQ AENHGVKFTG
WKLVEGRSNR KYTDEDAVAT KLKDEGYASD VIYQPQKIWG ISEMEKKIGK RLFADYLTEF
VVKPAGKATL VPESDKRPEI SSVASAVRDF DDLYENKLQH ETEKIPDDIL N