Gene Ccel_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0018 
Symbol 
ID7308941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp18289 
End bp19959 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content40% 
IMG OID643606946 
Productformate-tetrahydrofolate ligase FTHFS 
Protein accessionYP_002504386 
Protein GI220927477 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAAACTG ATATACAAAT TGCTCAGAGA TGTAAAATGC ATCATATTGC AGATATAGCA 
AAAAATCTCG GTATTGACAC CGAAGATATT GAGTTTTATG GTAATTATAA AGCCAAGTTA
TCAGATAAGC TTTGGGATAA GGTTAAGAAT AAAAAGGATG GCAAGCTGGT TCTTGTAACC
GCTATAAACC CTACTCCGGC AGGAGAAGGA AAAACCACAA CCACTGTGGG TCTTGGACAA
GCTATGGCAA GGATAGGTAA AAATGCAGTT ATAGCTTTGA GAGAACCATC AATGGGTCCG
GTAATGGGTA TTAAGGGAGG AGCTGCTGGA GGAGGTTACG CACAGGTTGT TCCCATGGAA
GACATAAACC TCCACTTTAC TGGAGATATG CATGCAATAA CTGCGGCAAA TAATTTGCTT
TCAGCTGCTA TAGACAACCA TCTTCAGCAA GGAAATATGT TGAATATTGA TTCCCGTCAA
ATTGTCTGGA AGCGTTGCAT GGATATGAAC GACAGGGCAT TAAGAAACGT AATTGTAGGA
CTCGGGGGAA AAATAAACGG TGTCCCAAGA GAAGACGGTT TTAACATTAC CGTCGCTTCT
GAAATAATGG CAATTCTCTG TCTGGCACTT GATATTAAAG ATCTCAAGAA AAGGCTTGGA
CGTATAATTA TTGGCTATAC TTACGAAGGC AAACCTGTAA CAGCCCATGA CTTAAAGGTT
GATGGTGCAA TGACACTGCT ACTGAAGGAT GCTATTAAGC CTAACCTTGT ACAAACACTT
GAAGGAACCC CTGCTTTAAT GCACGGCGGA CCTTTTGCAA ATATAGCTCA CGGTTGTAAT
AGTATTTCAG CAACAAAACT TGCACTGAAA CTGAGTGACT ACGTTATTAC CGAAGCAGGC
TTTGGTGCAG ACCTTGGTGC AGAGAAGTTT TTTGATATTA AGTGTAGATT TGCAGGATTC
AAGCCGGATG CAGTTGTTCT TGTTGCTACA ATAAGGGCTC TAAAATATAA CGGCGGTGTA
AGAAAAGAAG ACCTGAAAGA AGAGAATATT GACGCTTTAT CCAAGGGCTT TGCAAATGCA
GAGAAGCATA TCGAAAATCT GAAACAGTTT GGTGTACCTG TTATGGTTGC CATTAATCAT
TTTGATACCG ATACCGAGGC TGAAATAAAG CTGATTCAGG AAAAATGTAG TTCTCTAGGT
GTCGAGGTCG CCTTTTCAGA TGTATTTTTA AAAGGCGGTG AAGGCGGAAT AGAGCTGGCA
GAAAAGCTTG TGGCACTAAC AGATTCTACT GTTTCAAATT TTGCACCTAT ATATGATGAA
AAACTCCCCA TAAAGGAAAA AGTTCAACAA ATAGTTTCAA AGATTTACGG AGGCAGAAAC
GTTATTTATA ATGCGGCCGC AGAAAAGTCT ATTGCTAAGA TAGAAGAAAT GGGACTGGAC
AGACTTCCTA TTTGTATGGC AAAAACTCAG TATTCTCTAT CTGATAATCC TGCACTTCTT
GGGAGACCCC AAGACTTTGA CGTAACAGTA AAGGAAGTTC GGATTTCTGC AGGAGCCGGG
TTTTTAGTAG TACTTACCGG AGATATTATG ACAATGCCCG GTCTGCCAAA GGTACCGGCA
GCAGAAAGAA TTGATATAAA TGAATCGGGT GTTATTACTG GACTATTTTA A
 
Protein sequence
MQTDIQIAQR CKMHHIADIA KNLGIDTEDI EFYGNYKAKL SDKLWDKVKN KKDGKLVLVT 
AINPTPAGEG KTTTTVGLGQ AMARIGKNAV IALREPSMGP VMGIKGGAAG GGYAQVVPME
DINLHFTGDM HAITAANNLL SAAIDNHLQQ GNMLNIDSRQ IVWKRCMDMN DRALRNVIVG
LGGKINGVPR EDGFNITVAS EIMAILCLAL DIKDLKKRLG RIIIGYTYEG KPVTAHDLKV
DGAMTLLLKD AIKPNLVQTL EGTPALMHGG PFANIAHGCN SISATKLALK LSDYVITEAG
FGADLGAEKF FDIKCRFAGF KPDAVVLVAT IRALKYNGGV RKEDLKEENI DALSKGFANA
EKHIENLKQF GVPVMVAINH FDTDTEAEIK LIQEKCSSLG VEVAFSDVFL KGGEGGIELA
EKLVALTDST VSNFAPIYDE KLPIKEKVQQ IVSKIYGGRN VIYNAAAEKS IAKIEEMGLD
RLPICMAKTQ YSLSDNPALL GRPQDFDVTV KEVRISAGAG FLVVLTGDIM TMPGLPKVPA
AERIDINESG VITGLF