Gene Ccel_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0857 
Symbol 
ID7309702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp975785 
End bp977164 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content38% 
IMG OID643607794 
Productprotein of unknown function DUF214 
Protein accessionYP_002505209 
Protein GI220928300 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.20592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATTA TAGCTAAAAT AGCTTTAAGA AATATTTTTG CTAACAGGAG ACGCTCAATA 
CTTATAGGAA TAGTTATATT TATATGTGCG TTCCTCATTT TAGTATCAAA TTCAATGGCA
AACGGTGTTG AATTCCAGGT ACTAAAGGGT TATAAAAACA TCCAGTGTGC ACACGTAATA
GTGGGTTGGG AAAACCTTAA GAAGGTAAAT TCTTCAGATG CTACAAGACT TTTGTTCCTC
ACAAGCAATC CAAGTTTTGA ATTAACTAAG GATGCCGAGA ATCAAAGGGC CATAAATACA
TTGAATGAGT TTTTGGAAAA AAATAAGGAT AAGGTGGATG GATACTATCC AAGCATAAGA
AGAAGTATAA GTATCCTGTA TAACGGAGGT GAAGTACAAG ATTCACAGTT CTCTTTATAC
GGTTTGAACG CAAAAAGCAG TAAATTGATG ATTGATTCAA AAGCAATGAG CATGTATAAA
GGCGAACTTT TGTCAGAGGA TCCCAATGCT ATATGCATAA GCAAGCAAAA GGCGGAAGAA
GACAAATTGA ATCTAGGCGA TAAAGTTACT TTAGAGTCGA TAAATCCGGA CGGCTCAAGA
GGAACTCTGG ACTGCACCAT TAAAGGCATA TATGCCAACG GAGCGGGATG GGATAACATG
TACCTGTTTA TTAATGAGGA TACTGCAAAA CAACTTATGA AATACAAAGA CGGCTATTTC
GACCTTGGAA GGATATCCCT CAAAAATGAG AGTGATGCAA GCGAGTTTGC GAAAGACCTT
GATGCCGCTC TGATAAAAGA CGGTTCCAAG GTTTTGAGGG CTGAATCCTA TCTTCAGGCT
TCAAAACTTT ACCCTACAAT GTCCAAAAGC TTAAAAGGAC TATGCAATCT GTTCATACTT
ATATTCTTAG TAGTTATATC CATAGGACTC CGCTCTTCCA TAAGAATGAA CCTTTTCGAA
CGTATGAGAG AGTTTGGTAC TTTACGAGCA ATAGGTTACA GCAGACGTCA GTGCTTTTCG
ATAATATTTC TTGAAGTATT TTTCCTCTCA ATCATAGCAC TTTCGATTGC ATGCGGAATC
GCGGCAGTGT TGGTGAATAT GTTGGGTAAA TCGGGAGTTT ACTTGGGTAC TGGCCCTCTT
AGCTACTTCG GCGGCGAGCG GTTATATCCT AGCATGAAAC CTATGGATAT CAGTACAACA
TTCGGCATAA TAACATTATT CACATTGCTT TCTACTGTAA GCCCTGCACT TAAACTTTGT
TACCAGAATA TTACAAATAT AATGGTAAAG AACATGAAAA AAGTGAAGGT ATGGAGAACA
ATGTTTTTCG GAGAGAAACA CAGTAAAGTA AAATATGACG GACTTAATAA AGCAGTGTAA
 
Protein sequence
MGIIAKIALR NIFANRRRSI LIGIVIFICA FLILVSNSMA NGVEFQVLKG YKNIQCAHVI 
VGWENLKKVN SSDATRLLFL TSNPSFELTK DAENQRAINT LNEFLEKNKD KVDGYYPSIR
RSISILYNGG EVQDSQFSLY GLNAKSSKLM IDSKAMSMYK GELLSEDPNA ICISKQKAEE
DKLNLGDKVT LESINPDGSR GTLDCTIKGI YANGAGWDNM YLFINEDTAK QLMKYKDGYF
DLGRISLKNE SDASEFAKDL DAALIKDGSK VLRAESYLQA SKLYPTMSKS LKGLCNLFIL
IFLVVISIGL RSSIRMNLFE RMREFGTLRA IGYSRRQCFS IIFLEVFFLS IIALSIACGI
AAVLVNMLGK SGVYLGTGPL SYFGGERLYP SMKPMDISTT FGIITLFTLL STVSPALKLC
YQNITNIMVK NMKKVKVWRT MFFGEKHSKV KYDGLNKAV