Gene Athe_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2208 
Symbol 
ID7408404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2338858 
End bp2340516 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content36% 
IMG OID643716575 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_002574055 
Protein GI222530173 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000668109 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGCA TCTCAAAAGT AGAAGAAGTA CTCGAACCCA TATCCAAGAT TGCAGAAAAA 
ATAGGACTTG ACGAAGATGA AATTGAGCTT TATGGAAAAT ACAAGGCAAA GATAAGTTTG
GATATTCTTA AGAAAAAAGC ACAATTACAA GAGGGCAAGG TTATTTTAGT GACATCTATT
AACCCAACAC CTTTTGGAGA GGGGAAGACG ACAACTGCGA TTGGTCTTTC TATGGCAATA
AACAGGCTGG GGTTTAAATC TATCGTTACT TTAAGAGAAC CTTCCTTAGG ACCGTTTTTG
GGTTTAAAAG GTGGGGCAAC AGGTGGCGGC GCTTCTCAGA TTTTGCCCTC AATTGATATA
AATCTTCACT TTACAGGAGA CATTCATGCA GTGACCTCTG CAAACAATCT TCTTTGCGCT
GCTGTTGACA ACCACATTTA TCATGGAAAT AGACTTGGAA TAAATCCAAA GTCTATAACC
ATAAAAAGAG CAATGGATAT GAATGATAGA AGTCTTCGGC ACATTATAGT TGGACTTTCA
AATGACCAGA AAGGTGCTAT AAGAGAAGAT GGGTTTGTTA TCTCTGTTGC CTCTGAAGTG
ATGGCAGTTT TGTGTCTTTC AATGAGCTAT GACGATCTAA AAGAAAAACT TGGAAATATA
TTAGTAGGTT TTACCTATGA CAAAAAACCT GTGTATGCCA AGGATTTGAA TGTCCATGGG
AGTATGGCTC TTTTATTAAA AGATGCACTA AAACCAAACC TTGTTCAAAC TTCTGAAAAT
ACCGCTGCAA TTGTTCATGG TGGTCCTTTT GCAAATATTG CACACGGGAC AAATAGCATT
GTTGCAACAA AAATTGCTCA AAAACTTTCT GAATATGTAG TTGTTGAGGC AGGTTTTGGG
TCGGATTTAG GAGCAGAGAA GTTTATAAAT ATTGTTGCAA GAAAATCTGG AATATATCCA
CAAGCTGCTG TTCTTGTTGT GACAGTTAAA GCATTAAAAC ATCATGCGAA GATTGAAGAA
AATAGTGGTT TACAAAGTGG TGTAAATTCT ATTCAACAAG GACTTGAGAA TTTAGAAAAA
CACATTGAAA ATCTCAAAGT CATGGGGCTT GAGACAGTGG TGGCTTTAAA TAAGTTTCCG
GACGATAAAG ATGAAGAGAT TGAGCTTATC AGGTCTTTTT GTGAGGAAAT GGGTGTAGAA
TTTTCAGTAT CAAGTGCATA TACTCACGGG TCAGAAGGTG TGCTTGAGCT TGCTGAAAAG
GTTATAAGGT TGAGCGATAA AAGAAAAAGA ATAAACTTTG TTTACCAAGA CAGTGATTTT
ATCGAGGAGA AAATTAAAAA AGTTGCAACC ATCATCTATG GCGCAAAAGA TGTAAAGTTT
TCTAAAGCAG CTTTGTCAAA ACTTGAACTT ATAAAAAACC TCAAGGTTGA ACATTTTCCC
ATTTGTATGT CAAAAACTCA GTATTCGCTT TCTGATGACC CGAAATTACT TGGAAAACCA
AAAGATTTTA TATTAAATGT TACAGACATA GAAATAAAAA ATGGGGCTGG ATTTATAGTT
GTCATGTGCG GTGATATAAT TGCAATGCCA GGGCTTGGAA AAGACTTTGC AGCTCTTCAT
CTTGACATCG ACAGTAGCGG AAATCCCATT TTTAAATAA
 
Protein sequence
MKSISKVEEV LEPISKIAEK IGLDEDEIEL YGKYKAKISL DILKKKAQLQ EGKVILVTSI 
NPTPFGEGKT TTAIGLSMAI NRLGFKSIVT LREPSLGPFL GLKGGATGGG ASQILPSIDI
NLHFTGDIHA VTSANNLLCA AVDNHIYHGN RLGINPKSIT IKRAMDMNDR SLRHIIVGLS
NDQKGAIRED GFVISVASEV MAVLCLSMSY DDLKEKLGNI LVGFTYDKKP VYAKDLNVHG
SMALLLKDAL KPNLVQTSEN TAAIVHGGPF ANIAHGTNSI VATKIAQKLS EYVVVEAGFG
SDLGAEKFIN IVARKSGIYP QAAVLVVTVK ALKHHAKIEE NSGLQSGVNS IQQGLENLEK
HIENLKVMGL ETVVALNKFP DDKDEEIELI RSFCEEMGVE FSVSSAYTHG SEGVLELAEK
VIRLSDKRKR INFVYQDSDF IEEKIKKVAT IIYGAKDVKF SKAALSKLEL IKNLKVEHFP
ICMSKTQYSL SDDPKLLGKP KDFILNVTDI EIKNGAGFIV VMCGDIIAMP GLGKDFAALH
LDIDSSGNPI FK