Gene Moth_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0109 
Symbol 
ID3831999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp106977 
End bp108656 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content58% 
IMG OID637828043 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_428991 
Protein GI83588982 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCAAGG TACCCAGTGA TATTGAGATT GCCCAGGCAG CCAAAATGAA ACCGGTCATG 
GAACTGGCCC GGGGACTGGG CATCCAAGAG GACGAGGTCG AGCTTTATGG TAAGTACAAG
GCCAAGATCT CCCTCGATGT CTATCGTCGC CTCAAAGACA AGCCTGACGG GAAACTAATC
CTGGTAACCG CCATTACCCC TACTCCGGCC GGCGAAGGGA AAACTACTAC CAGTGTCGGT
CTCACCGATG CCCTGGCTCG CCTGGGGAAA AGGGTGATGG TCTGCCTGCG GGAGCCCTCC
CTGGGACCCA GCTTTGGTAT CAAAGGCGGT GCCGCCGGCG GTGGTTATGC CCAGGTAGTA
CCCATGGAAG ATATCAACCT GCACTTCACC GGCGATATCC ACGCCGTCAC CTATGCCCAC
AACCTGCTGG CGGCCATGGT GGATAACCAC CTGCAGCAGG GTAACGTCCT GAATATTGAT
CCCCGTACCA TCACCTGGCG CCGGGTCATC GACCTTAATG ACCGGGCTCT GAGGAACATA
GTCATCGGCC TGGGTGGCAA AGCCAACGGC GTACCGCGGG AGACAGGGTT TGACATCTCC
GTTGCCTCGG AGGTTATGGC CTGCCTGTGC CTGGCCAGCG ACCTCATGGA TCTCAAGGAA
CGTTTCAGCC GCATTGTTGT CGGCTACACC TATGACGGCA AACCGGTCAC CGCCGGCGAT
CTGGAGGCCC AGGGTTCCAT GGCTCTTCTC ATGAAGGACG CCATTAAACC CAACCTGGTC
CAAACCCTGG AGAATACGCC GGCCTTTATC CACGGTGGTC CCTTCGCCAA TATCGCCCAC
GGTTGCAACA GCATTATCGC AACCAAGACG GCCCTGAAAC TGGCGGATTA TGTCGTGACG
GAAGCCGGTT TCGGTGCCGA CCTGGGTGCC GAGAAGTTCT ATGACGTTAA ATGCCGTTAT
GCCGGCTTTA AACCCGATGC CACAGTCATC GTGGCTACCG TCCGCGCCCT CAAGATGCAC
GGCGGCGTAC CCAAATCAGA CCTGGCCACT GAAAACCTGG AAGCCCTGCG GGAAGGCTTT
GCCAACCTGG AGAAACACAT CGAAAATATC GGCAAGTTCG GCGTACCGGC AGTCGTGGCC
ATCAATGCCT TCCCCACCGA TACCGAGGCC GAGCTAAATC TCCTCTACGA GTTGTGCGCC
AAAGCTGGGG CCGAAGTTGC CCTCTCGGAA GTCTGGGCTA AGGGCGGCGA AGGCGGTCTG
GAACTTGCCC GGAAGGTGTT GCAGACCCTG GAGAGCAGGC CATCCAACTT CCATGTCCTC
TACAACCTGG ACCTGAGTAT TAAAGACAAA ATTGCCAAAA TCGCCACCGA GATCTACGGG
GCCGACGGCG TCAACTATAC GGCCGAAGCC GACAAAGCTA TCCAGCGTTA TGAATCCCTG
GGCTACGGCA ACCTGCCGGT GGTCATGGCC AAGACCCAAT ACTCCTTTTC CGATGACATG
ACCAAGCTCG GGCGGCCGCG GAACTTTACC ATCACCGTGC GCGAGGTGCG CCTCTCGGCC
GGAGCAGGCT TTATCGTCCC CATCACCGGC GCCATAATGA CCATGCCCGG GCTGCCCAAA
CGCCCGGCGG CCTGCAACAT CGACATCGAT GCCGACGGCG TCATTACCGG TCTTTTCTAG
 
Protein sequence
MSKVPSDIEI AQAAKMKPVM ELARGLGIQE DEVELYGKYK AKISLDVYRR LKDKPDGKLI 
LVTAITPTPA GEGKTTTSVG LTDALARLGK RVMVCLREPS LGPSFGIKGG AAGGGYAQVV
PMEDINLHFT GDIHAVTYAH NLLAAMVDNH LQQGNVLNID PRTITWRRVI DLNDRALRNI
VIGLGGKANG VPRETGFDIS VASEVMACLC LASDLMDLKE RFSRIVVGYT YDGKPVTAGD
LEAQGSMALL MKDAIKPNLV QTLENTPAFI HGGPFANIAH GCNSIIATKT ALKLADYVVT
EAGFGADLGA EKFYDVKCRY AGFKPDATVI VATVRALKMH GGVPKSDLAT ENLEALREGF
ANLEKHIENI GKFGVPAVVA INAFPTDTEA ELNLLYELCA KAGAEVALSE VWAKGGEGGL
ELARKVLQTL ESRPSNFHVL YNLDLSIKDK IAKIATEIYG ADGVNYTAEA DKAIQRYESL
GYGNLPVVMA KTQYSFSDDM TKLGRPRNFT ITVREVRLSA GAGFIVPITG AIMTMPGLPK
RPAACNIDID ADGVITGLF