Gene Cthe_2399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2399 
Symbol 
ID4811051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2864783 
End bp2866453 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content46% 
IMG OID640107812 
Productformate-tetrahydrofolate ligase 
Protein accessionYP_001038794 
Protein GI125974884 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCACGG ATATTCAAAT AGCCCAATCA TGCAAAATGA AGCCCATAAC TCAAGTTGCG 
GCAGAGCTTG GCATTGATGA GGAAGAACTT GAGCTTTACG GAAAATACAA AGCAAAATTA
TCCGACAAGC TCTGGGAAAG AGTAAAGGAC AGGCCTGACG GCAAACTTGT TCTGGTGACT
GCGATAAACC CCACACCGGC CGGTGAGGGA AAGACCACCA CCACCGTCGG ACTGGGTCAG
GCCATGGCAA GAATCGGGAA AAAAGCAGTG ATTGCTTTAA GAGAACCATC TTTAGGTCCC
GTAATGGGAA TAAAAGGCGG AGCCGCCGGA GGAGGATACT CCCAGGTAGT TCCCATGGAA
GACATAAATT TGCATTTTAC CGGAGACATG CACGCGATAA CCGCTGCCAA CAACTTGTTG
TCAGCCGCTA TCGATAATCA TATACAGCAG GGAAACGAGC TTAATATTGA CGTAAGACAG
ATAATATGGA AAAGAGCAAT GGACATGAAC GACCGGGCGC TAAGAAACAT TGTGGTGGGT
TTAGGCGGCA AAGCAAACGG TGTGCCCAGG GAAGACGGTT TCCAAATAAC GGTGGCGTCG
GAGGTTATGG CTGTTTTATG CCTCTCAACC GGACTTATGG ACTTAAAAGA GCGCCTTGGA
AGAATACTGA TTGGGTACAC TTATGACGGA AAACCGGTCT TTGCAAAGGA TTTAAAGGTA
AACGGCGCAA TGGCTCTGCT TTTAAAAGAT GCCATAAAGC CAAATCTAGT TCAAACCCTG
GAAAACACTC CTGCAATAGT GCACGGAGGT CCTTTTGCCA ACATAGCCCA CGGCTGCAAC
AGCATTGTTG CCACCCGGCT TGGTTTGAAA CTTGCAGATT ACTGTATCAC AGAAGCCGGC
TTCGGTGCCG ACCTGGGTGC GGAAAAGTTT TTCAACATCA AGTGCCGCTA TGCCGGATTA
AAGCCTGATT TGGTCGTGCT GGTGGCCACC ATAAGGGCTC TTAAGTATAA CGGCGGTGTG
AAAAAAGAGA ATCTGGGAAT TGAGAACCTT CCGGCACTTG AAAAAGGATT TGTCAATCTT
GAAAAGCATA TAGAAAACAT CAGAAAGTTC CAGGTTCCGC TTCTTGTTGC CATCAACCAT
TTTGACACCG ACTCCGAAGC TGAAATCGAA TATGTTAAAA ACAGATGCAA AGCCTTAAAC
GTAGAAGTTG CTTTCTCGGA TGTCTTCTCA AAAGGTTCCG AAGGTGGTAT AGAGCTTGCC
GAAAAAGTTG TAAAACTTAC CGAAACACAA AAGTCAAATT TCAAACCTCT GTACGACGTC
AATCTTTCCA TAAGGGAAAA AATAGAGATA ATTGCCAGGG AAATTTACGG TGCGGACAGT
GTCAACATTT TGCCGGCAGC CGAAAGAGCA ATCAAAAAAA TTGAAGAGCT TAAAATGGAC
AAGCTGCCCA TATGTGTAGC CAAGACACAG TACTCCCTTT CCGACGATCC AACCCTTTTG
GGAAGGCCGC AGGGGTTTGT CATCACAGTG AGGGAAATAA AGCTTTCCAG CGGAGCAGGA
TTTATTGTGG CAATTACCGG GGACATCATG ACAATGCCAG GTCTTCCCAA AGTTCCCGCC
GCAGAAAAAA TCGATATAGA CGAAAACGGA GTTATTACAG GTCTCTTTTA A
 
Protein sequence
MLTDIQIAQS CKMKPITQVA AELGIDEEEL ELYGKYKAKL SDKLWERVKD RPDGKLVLVT 
AINPTPAGEG KTTTTVGLGQ AMARIGKKAV IALREPSLGP VMGIKGGAAG GGYSQVVPME
DINLHFTGDM HAITAANNLL SAAIDNHIQQ GNELNIDVRQ IIWKRAMDMN DRALRNIVVG
LGGKANGVPR EDGFQITVAS EVMAVLCLST GLMDLKERLG RILIGYTYDG KPVFAKDLKV
NGAMALLLKD AIKPNLVQTL ENTPAIVHGG PFANIAHGCN SIVATRLGLK LADYCITEAG
FGADLGAEKF FNIKCRYAGL KPDLVVLVAT IRALKYNGGV KKENLGIENL PALEKGFVNL
EKHIENIRKF QVPLLVAINH FDTDSEAEIE YVKNRCKALN VEVAFSDVFS KGSEGGIELA
EKVVKLTETQ KSNFKPLYDV NLSIREKIEI IAREIYGADS VNILPAAERA IKKIEELKMD
KLPICVAKTQ YSLSDDPTLL GRPQGFVITV REIKLSSGAG FIVAITGDIM TMPGLPKVPA
AEKIDIDENG VITGLF