Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2208 |
Symbol | |
ID | 7408404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2338858 |
End bp | 2340516 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716575 |
Product | Formate--tetrahydrofolate ligase |
Protein accession | YP_002574055 |
Protein GI | 222530173 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000668109 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGCA TCTCAAAAGT AGAAGAAGTA CTCGAACCCA TATCCAAGAT TGCAGAAAAA ATAGGACTTG ACGAAGATGA AATTGAGCTT TATGGAAAAT ACAAGGCAAA GATAAGTTTG GATATTCTTA AGAAAAAAGC ACAATTACAA GAGGGCAAGG TTATTTTAGT GACATCTATT AACCCAACAC CTTTTGGAGA GGGGAAGACG ACAACTGCGA TTGGTCTTTC TATGGCAATA AACAGGCTGG GGTTTAAATC TATCGTTACT TTAAGAGAAC CTTCCTTAGG ACCGTTTTTG GGTTTAAAAG GTGGGGCAAC AGGTGGCGGC GCTTCTCAGA TTTTGCCCTC AATTGATATA AATCTTCACT TTACAGGAGA CATTCATGCA GTGACCTCTG CAAACAATCT TCTTTGCGCT GCTGTTGACA ACCACATTTA TCATGGAAAT AGACTTGGAA TAAATCCAAA GTCTATAACC ATAAAAAGAG CAATGGATAT GAATGATAGA AGTCTTCGGC ACATTATAGT TGGACTTTCA AATGACCAGA AAGGTGCTAT AAGAGAAGAT GGGTTTGTTA TCTCTGTTGC CTCTGAAGTG ATGGCAGTTT TGTGTCTTTC AATGAGCTAT GACGATCTAA AAGAAAAACT TGGAAATATA TTAGTAGGTT TTACCTATGA CAAAAAACCT GTGTATGCCA AGGATTTGAA TGTCCATGGG AGTATGGCTC TTTTATTAAA AGATGCACTA AAACCAAACC TTGTTCAAAC TTCTGAAAAT ACCGCTGCAA TTGTTCATGG TGGTCCTTTT GCAAATATTG CACACGGGAC AAATAGCATT GTTGCAACAA AAATTGCTCA AAAACTTTCT GAATATGTAG TTGTTGAGGC AGGTTTTGGG TCGGATTTAG GAGCAGAGAA GTTTATAAAT ATTGTTGCAA GAAAATCTGG AATATATCCA CAAGCTGCTG TTCTTGTTGT GACAGTTAAA GCATTAAAAC ATCATGCGAA GATTGAAGAA AATAGTGGTT TACAAAGTGG TGTAAATTCT ATTCAACAAG GACTTGAGAA TTTAGAAAAA CACATTGAAA ATCTCAAAGT CATGGGGCTT GAGACAGTGG TGGCTTTAAA TAAGTTTCCG GACGATAAAG ATGAAGAGAT TGAGCTTATC AGGTCTTTTT GTGAGGAAAT GGGTGTAGAA TTTTCAGTAT CAAGTGCATA TACTCACGGG TCAGAAGGTG TGCTTGAGCT TGCTGAAAAG GTTATAAGGT TGAGCGATAA AAGAAAAAGA ATAAACTTTG TTTACCAAGA CAGTGATTTT ATCGAGGAGA AAATTAAAAA AGTTGCAACC ATCATCTATG GCGCAAAAGA TGTAAAGTTT TCTAAAGCAG CTTTGTCAAA ACTTGAACTT ATAAAAAACC TCAAGGTTGA ACATTTTCCC ATTTGTATGT CAAAAACTCA GTATTCGCTT TCTGATGACC CGAAATTACT TGGAAAACCA AAAGATTTTA TATTAAATGT TACAGACATA GAAATAAAAA ATGGGGCTGG ATTTATAGTT GTCATGTGCG GTGATATAAT TGCAATGCCA GGGCTTGGAA AAGACTTTGC AGCTCTTCAT CTTGACATCG ACAGTAGCGG AAATCCCATT TTTAAATAA
|
Protein sequence | MKSISKVEEV LEPISKIAEK IGLDEDEIEL YGKYKAKISL DILKKKAQLQ EGKVILVTSI NPTPFGEGKT TTAIGLSMAI NRLGFKSIVT LREPSLGPFL GLKGGATGGG ASQILPSIDI NLHFTGDIHA VTSANNLLCA AVDNHIYHGN RLGINPKSIT IKRAMDMNDR SLRHIIVGLS NDQKGAIRED GFVISVASEV MAVLCLSMSY DDLKEKLGNI LVGFTYDKKP VYAKDLNVHG SMALLLKDAL KPNLVQTSEN TAAIVHGGPF ANIAHGTNSI VATKIAQKLS EYVVVEAGFG SDLGAEKFIN IVARKSGIYP QAAVLVVTVK ALKHHAKIEE NSGLQSGVNS IQQGLENLEK HIENLKVMGL ETVVALNKFP DDKDEEIELI RSFCEEMGVE FSVSSAYTHG SEGVLELAEK VIRLSDKRKR INFVYQDSDF IEEKIKKVAT IIYGAKDVKF SKAALSKLEL IKNLKVEHFP ICMSKTQYSL SDDPKLLGKP KDFILNVTDI EIKNGAGFIV VMCGDIIAMP GLGKDFAALH LDIDSSGNPI FK
|
| |