Gene Moth_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0533 
Symbol 
ID3830918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp556475 
End bp557797 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content58% 
IMG OID637828474 
ProductFolC bifunctional protein 
Protein accessionYP_429406 
Protein GI83589397 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.861672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTACC AGGAAGCTCT TGAATTTTTG CGGCAACTAA CCAAGTTTGG TTTTAACCTG 
GGCCTGGGGC GAATTAAGGA ACTCATGCGC CGGGTCGGTT CGCCCCAGGA GCGCCTGCGC
TTTATCCATA TCGGTGGGAC CAACGGCAAG GGCTCGGTCT CGGCCATGGT GGCCAGTATC
CTTAAGGCGG CTGGCTACCG GGTGGGACTA TTTACCTCTC CCCACCTTCA TTCCTATACC
GAACGGATCC AGATTAACGG TCAGAATATC CCTGAGGAAC GCCTGGCAGC CCTCCTTACC
TGGTTTAAAC CACTTTTGAC GGCTATGGTA GCCGACGGTT ATGAACACCC GACGGAATTC
GAAGTCGGCA CTGCCGTAGC CCTGAAGTAC TTTGCCGACG AAGGGGTAGA CCTGGTAGTT
CTGGAAGTAG GACTGGGTGG GGCCATTGAT TCTACTAATG TGATTAATAC TTCACTGGTG
AGCGTTATAA CAAATGTCGG CATGGACCAC ATGCAGTATT TGGGGAACAC TATAGCTGAG
ATCGCCCGGG TCAAGGCCGG TATTATCAAA CCCGGGGGGA TTGTGGTTAC TGCCTCGCGT
TTGCCGGAAG CCCTGGAAGT TATCAGCACC ACCTGCCGGG AAAAGGGGGC TACCCTTTAC
CAGGTAGGCC GGGATGTCAC CTGGCGGGAA CGACGGGTAT CCCTGGCGGG AGGCGAATTT
GATTGCCGGG GTTTGCTGGC TACCTATGAA GGCCTCAAGG TTCACCTCCT GGGGCGCCAC
CAGCTGGAGA ATGCGGCCAC GGCGGTGGCG GTAATCGAAG CAGCCGTCCG CCACCACGGG
CTAAAGGTAA CACCGGACCA CCTGCGCCAG GGCCTGGCCG CCGCCACCTG GCCGGCGCGC
CTGGAAATCA TACAACGGGA ACCAATGGTC ATCATTGACG GCGCCCACAA CTTTGATGGG
GCCGTAAGTC TACGCCTGGC CCTGGAAGAG ATTTTCCGCT ATCGCCGCCT GATCCTGGTC
CTGGGAATGC TGGCTGATAA AGAGCGGGAG AAAGTGGTGG CCGTACTGGC CCCCTTGGCG
GCGGCCGTGA TTGTCACCCG GCCCAATAAC CCCCGGGCCG GGAACTGGCA GTCCCTGGCC
GACAGCGTGA GACGTTATGT CGGCGTGGTG GAGGTTATCG AAGCCATTCC TGCGGCAGTC
GAGAGGGCCC TGGCCCTTGC CGAGCCCTCC GACCTCATCT GTGTTACCGG TTCCTTGTAC
ATGGTGGCAG ACGCCAGGGA ATGGCTGAAG AAGTTTAAAA AAGAGGAATC TCCAGGGGGG
TAG
 
Protein sequence
MNYQEALEFL RQLTKFGFNL GLGRIKELMR RVGSPQERLR FIHIGGTNGK GSVSAMVASI 
LKAAGYRVGL FTSPHLHSYT ERIQINGQNI PEERLAALLT WFKPLLTAMV ADGYEHPTEF
EVGTAVALKY FADEGVDLVV LEVGLGGAID STNVINTSLV SVITNVGMDH MQYLGNTIAE
IARVKAGIIK PGGIVVTASR LPEALEVIST TCREKGATLY QVGRDVTWRE RRVSLAGGEF
DCRGLLATYE GLKVHLLGRH QLENAATAVA VIEAAVRHHG LKVTPDHLRQ GLAAATWPAR
LEIIQREPMV IIDGAHNFDG AVSLRLALEE IFRYRRLILV LGMLADKERE KVVAVLAPLA
AAVIVTRPNN PRAGNWQSLA DSVRRYVGVV EVIEAIPAAV ERALALAEPS DLICVTGSLY
MVADAREWLK KFKKEESPGG