Gene GWCH70_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2572 
Symbol 
ID7976335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2599611 
End bp2600906 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content43% 
IMG OID644799373 
ProductFolC bifunctional protein 
Protein accessionYP_002950533 
Protein GI239827909 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTC GTACATATGA AGAGGCGCTA GAGTGGATTC ACGGCCGTTT GCGATTAGGA 
ATAAAACCAG GGCTAAAACG AATGGAATGG TTGATGGACA AGCTCGGACA TCCGGAACGC
CGCATCAGGG CGATTCATAT CGCGGGAACA AACGGAAAAG GATCAACGGT AAGCTATTTG
CGCCATATTT TACAAGCGGC AGGTTACTCT GTCGGAACGT TCACTTCCCC GTATGTGGAG
CAATTTAATG AACGAATAAG CGTAAATGGA AACCCAATTC GTGACAAGGA AATCGTTGAA
CTCGTGCAAG TAATCCAACC GCTTGCAGAA GAATTGGAAA AAACAGAACT TGGCGCGCCA
ACGGAATTTG AAGTCATTAC GGCGATGATG CTTTATTATT TCGGGAAAAA GAATATTCAA
GACGTTGTGC TTATTGAAGC CGGACTTGGC GGCCGCTTTG ATTCGACGAA CGTCATTTAT
CCGCTTCTTT CCATTATCAC CAACATCAGC TATGACCATA TGAATTTTTT AGGAGAAACA
TTAGAAAAAA TTGCCTTTGA AAAAGCGGGC ATCATTAAAT CAGGCGTTCC GGTGATTACA
GCGGTAAACC AGCCAGAAGC ATGGACGGTT ATTTCAGAAA AAGCGAAGTC GCTCAAAGCG
AAAACATATC GATTAGAAGA AGACTTTTTT ATTGTTCATC ATGAGTCAAC AGAGGATGGC
GAACATTTTT CAATGGAAAC AATATTTTCG CAATATCCCG ATTTAAAAAT AATGATGTTT
GGAGAACATC AAGTGCAAAA TGCGGCGCTC GCGGTGATGG CAGCGGAGTA TTTGCGGATG
TGCTATTCGT TTTTGATTGA AAAAGAGCAT ATTTATGAGG GGATAGAGAA AGCGAAGTGG
ATCGGACGAT TTGAGCGGGT AAGCAATAAG CCGCTGATTA TTATTGACGG CGCTCATAAC
GCGGCAGGTA TTCATAGTCT TGTTAATACG GTACGTTCGC ACTATCCAAA CAAAGATGTT
CATGTGTTAT TTGCTGCGTT AGGGGATAAA CCGCTTGATC AAATGATTCC GCCGCTTGCC
GAAATTGCCA AAACGATAAC GTTTACTTCG TTCGATTTTC CGCGCGCAGC GTCGCCTGAG
CAGCTTGCCG CCTTGTGCAA CCATCCGCAC AAAGCAACGA CGGACGACTG GAAAGGCTGG
GTCAAAGAAA AGAAAAAACA AAAACGAAGC GATGACCTTT TCTTGATTAC CGGATCGCTT
TACTTTATTG CGGAAGTAAG AAAGTTATTG ATGTAA
 
Protein sequence
MMIRTYEEAL EWIHGRLRLG IKPGLKRMEW LMDKLGHPER RIRAIHIAGT NGKGSTVSYL 
RHILQAAGYS VGTFTSPYVE QFNERISVNG NPIRDKEIVE LVQVIQPLAE ELEKTELGAP
TEFEVITAMM LYYFGKKNIQ DVVLIEAGLG GRFDSTNVIY PLLSIITNIS YDHMNFLGET
LEKIAFEKAG IIKSGVPVIT AVNQPEAWTV ISEKAKSLKA KTYRLEEDFF IVHHESTEDG
EHFSMETIFS QYPDLKIMMF GEHQVQNAAL AVMAAEYLRM CYSFLIEKEH IYEGIEKAKW
IGRFERVSNK PLIIIDGAHN AAGIHSLVNT VRSHYPNKDV HVLFAALGDK PLDQMIPPLA
EIAKTITFTS FDFPRAASPE QLAALCNHPH KATTDDWKGW VKEKKKQKRS DDLFLITGSL
YFIAEVRKLL M