Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0109 |
Symbol | |
ID | 3831999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 106977 |
End bp | 108656 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828043 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_428991 |
Protein GI | 83588982 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCAAGG TACCCAGTGA TATTGAGATT GCCCAGGCAG CCAAAATGAA ACCGGTCATG GAACTGGCCC GGGGACTGGG CATCCAAGAG GACGAGGTCG AGCTTTATGG TAAGTACAAG GCCAAGATCT CCCTCGATGT CTATCGTCGC CTCAAAGACA AGCCTGACGG GAAACTAATC CTGGTAACCG CCATTACCCC TACTCCGGCC GGCGAAGGGA AAACTACTAC CAGTGTCGGT CTCACCGATG CCCTGGCTCG CCTGGGGAAA AGGGTGATGG TCTGCCTGCG GGAGCCCTCC CTGGGACCCA GCTTTGGTAT CAAAGGCGGT GCCGCCGGCG GTGGTTATGC CCAGGTAGTA CCCATGGAAG ATATCAACCT GCACTTCACC GGCGATATCC ACGCCGTCAC CTATGCCCAC AACCTGCTGG CGGCCATGGT GGATAACCAC CTGCAGCAGG GTAACGTCCT GAATATTGAT CCCCGTACCA TCACCTGGCG CCGGGTCATC GACCTTAATG ACCGGGCTCT GAGGAACATA GTCATCGGCC TGGGTGGCAA AGCCAACGGC GTACCGCGGG AGACAGGGTT TGACATCTCC GTTGCCTCGG AGGTTATGGC CTGCCTGTGC CTGGCCAGCG ACCTCATGGA TCTCAAGGAA CGTTTCAGCC GCATTGTTGT CGGCTACACC TATGACGGCA AACCGGTCAC CGCCGGCGAT CTGGAGGCCC AGGGTTCCAT GGCTCTTCTC ATGAAGGACG CCATTAAACC CAACCTGGTC CAAACCCTGG AGAATACGCC GGCCTTTATC CACGGTGGTC CCTTCGCCAA TATCGCCCAC GGTTGCAACA GCATTATCGC AACCAAGACG GCCCTGAAAC TGGCGGATTA TGTCGTGACG GAAGCCGGTT TCGGTGCCGA CCTGGGTGCC GAGAAGTTCT ATGACGTTAA ATGCCGTTAT GCCGGCTTTA AACCCGATGC CACAGTCATC GTGGCTACCG TCCGCGCCCT CAAGATGCAC GGCGGCGTAC CCAAATCAGA CCTGGCCACT GAAAACCTGG AAGCCCTGCG GGAAGGCTTT GCCAACCTGG AGAAACACAT CGAAAATATC GGCAAGTTCG GCGTACCGGC AGTCGTGGCC ATCAATGCCT TCCCCACCGA TACCGAGGCC GAGCTAAATC TCCTCTACGA GTTGTGCGCC AAAGCTGGGG CCGAAGTTGC CCTCTCGGAA GTCTGGGCTA AGGGCGGCGA AGGCGGTCTG GAACTTGCCC GGAAGGTGTT GCAGACCCTG GAGAGCAGGC CATCCAACTT CCATGTCCTC TACAACCTGG ACCTGAGTAT TAAAGACAAA ATTGCCAAAA TCGCCACCGA GATCTACGGG GCCGACGGCG TCAACTATAC GGCCGAAGCC GACAAAGCTA TCCAGCGTTA TGAATCCCTG GGCTACGGCA ACCTGCCGGT GGTCATGGCC AAGACCCAAT ACTCCTTTTC CGATGACATG ACCAAGCTCG GGCGGCCGCG GAACTTTACC ATCACCGTGC GCGAGGTGCG CCTCTCGGCC GGAGCAGGCT TTATCGTCCC CATCACCGGC GCCATAATGA CCATGCCCGG GCTGCCCAAA CGCCCGGCGG CCTGCAACAT CGACATCGAT GCCGACGGCG TCATTACCGG TCTTTTCTAG
|
Protein sequence | MSKVPSDIEI AQAAKMKPVM ELARGLGIQE DEVELYGKYK AKISLDVYRR LKDKPDGKLI LVTAITPTPA GEGKTTTSVG LTDALARLGK RVMVCLREPS LGPSFGIKGG AAGGGYAQVV PMEDINLHFT GDIHAVTYAH NLLAAMVDNH LQQGNVLNID PRTITWRRVI DLNDRALRNI VIGLGGKANG VPRETGFDIS VASEVMACLC LASDLMDLKE RFSRIVVGYT YDGKPVTAGD LEAQGSMALL MKDAIKPNLV QTLENTPAFI HGGPFANIAH GCNSIIATKT ALKLADYVVT EAGFGADLGA EKFYDVKCRY AGFKPDATVI VATVRALKMH GGVPKSDLAT ENLEALREGF ANLEKHIENI GKFGVPAVVA INAFPTDTEA ELNLLYELCA KAGAEVALSE VWAKGGEGGL ELARKVLQTL ESRPSNFHVL YNLDLSIKDK IAKIATEIYG ADGVNYTAEA DKAIQRYESL GYGNLPVVMA KTQYSFSDDM TKLGRPRNFT ITVREVRLSA GAGFIVPITG AIMTMPGLPK RPAACNIDID ADGVITGLF
|
| |