Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2399 |
Symbol | |
ID | 4811051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2864783 |
End bp | 2866453 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107812 |
Product | formate-tetrahydrofolate ligase |
Protein accession | YP_001038794 |
Protein GI | 125974884 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCACGG ATATTCAAAT AGCCCAATCA TGCAAAATGA AGCCCATAAC TCAAGTTGCG GCAGAGCTTG GCATTGATGA GGAAGAACTT GAGCTTTACG GAAAATACAA AGCAAAATTA TCCGACAAGC TCTGGGAAAG AGTAAAGGAC AGGCCTGACG GCAAACTTGT TCTGGTGACT GCGATAAACC CCACACCGGC CGGTGAGGGA AAGACCACCA CCACCGTCGG ACTGGGTCAG GCCATGGCAA GAATCGGGAA AAAAGCAGTG ATTGCTTTAA GAGAACCATC TTTAGGTCCC GTAATGGGAA TAAAAGGCGG AGCCGCCGGA GGAGGATACT CCCAGGTAGT TCCCATGGAA GACATAAATT TGCATTTTAC CGGAGACATG CACGCGATAA CCGCTGCCAA CAACTTGTTG TCAGCCGCTA TCGATAATCA TATACAGCAG GGAAACGAGC TTAATATTGA CGTAAGACAG ATAATATGGA AAAGAGCAAT GGACATGAAC GACCGGGCGC TAAGAAACAT TGTGGTGGGT TTAGGCGGCA AAGCAAACGG TGTGCCCAGG GAAGACGGTT TCCAAATAAC GGTGGCGTCG GAGGTTATGG CTGTTTTATG CCTCTCAACC GGACTTATGG ACTTAAAAGA GCGCCTTGGA AGAATACTGA TTGGGTACAC TTATGACGGA AAACCGGTCT TTGCAAAGGA TTTAAAGGTA AACGGCGCAA TGGCTCTGCT TTTAAAAGAT GCCATAAAGC CAAATCTAGT TCAAACCCTG GAAAACACTC CTGCAATAGT GCACGGAGGT CCTTTTGCCA ACATAGCCCA CGGCTGCAAC AGCATTGTTG CCACCCGGCT TGGTTTGAAA CTTGCAGATT ACTGTATCAC AGAAGCCGGC TTCGGTGCCG ACCTGGGTGC GGAAAAGTTT TTCAACATCA AGTGCCGCTA TGCCGGATTA AAGCCTGATT TGGTCGTGCT GGTGGCCACC ATAAGGGCTC TTAAGTATAA CGGCGGTGTG AAAAAAGAGA ATCTGGGAAT TGAGAACCTT CCGGCACTTG AAAAAGGATT TGTCAATCTT GAAAAGCATA TAGAAAACAT CAGAAAGTTC CAGGTTCCGC TTCTTGTTGC CATCAACCAT TTTGACACCG ACTCCGAAGC TGAAATCGAA TATGTTAAAA ACAGATGCAA AGCCTTAAAC GTAGAAGTTG CTTTCTCGGA TGTCTTCTCA AAAGGTTCCG AAGGTGGTAT AGAGCTTGCC GAAAAAGTTG TAAAACTTAC CGAAACACAA AAGTCAAATT TCAAACCTCT GTACGACGTC AATCTTTCCA TAAGGGAAAA AATAGAGATA ATTGCCAGGG AAATTTACGG TGCGGACAGT GTCAACATTT TGCCGGCAGC CGAAAGAGCA ATCAAAAAAA TTGAAGAGCT TAAAATGGAC AAGCTGCCCA TATGTGTAGC CAAGACACAG TACTCCCTTT CCGACGATCC AACCCTTTTG GGAAGGCCGC AGGGGTTTGT CATCACAGTG AGGGAAATAA AGCTTTCCAG CGGAGCAGGA TTTATTGTGG CAATTACCGG GGACATCATG ACAATGCCAG GTCTTCCCAA AGTTCCCGCC GCAGAAAAAA TCGATATAGA CGAAAACGGA GTTATTACAG GTCTCTTTTA A
|
Protein sequence | MLTDIQIAQS CKMKPITQVA AELGIDEEEL ELYGKYKAKL SDKLWERVKD RPDGKLVLVT AINPTPAGEG KTTTTVGLGQ AMARIGKKAV IALREPSLGP VMGIKGGAAG GGYSQVVPME DINLHFTGDM HAITAANNLL SAAIDNHIQQ GNELNIDVRQ IIWKRAMDMN DRALRNIVVG LGGKANGVPR EDGFQITVAS EVMAVLCLST GLMDLKERLG RILIGYTYDG KPVFAKDLKV NGAMALLLKD AIKPNLVQTL ENTPAIVHGG PFANIAHGCN SIVATRLGLK LADYCITEAG FGADLGAEKF FNIKCRYAGL KPDLVVLVAT IRALKYNGGV KKENLGIENL PALEKGFVNL EKHIENIRKF QVPLLVAINH FDTDSEAEIE YVKNRCKALN VEVAFSDVFS KGSEGGIELA EKVVKLTETQ KSNFKPLYDV NLSIREKIEI IAREIYGADS VNILPAAERA IKKIEELKMD KLPICVAKTQ YSLSDDPTLL GRPQGFVITV REIKLSSGAG FIVAITGDIM TMPGLPKVPA AEKIDIDENG VITGLF
|
| |