Gene Emin_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1052 
Symbol 
ID6263906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1145050 
End bp1146720 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content44% 
IMG OID642611532 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001875942 
Protein GI187251460 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGCG ATATCGAAAT TGCGCAGCGC GCTAAAGTAT GGCCTATCGC CAAAGTGGCC 
GTAAAATTGG GTATAAAAAA ATCCCAAATA GAACTTTACG GACACTATAA GGCAAAACTT
TCTTTTGACT GCATAAAAAA ATTGCAAAAG AAACCTGACG GCAACCTTAT TTTAGTTACG
GCCATTTCAC CAACTGCCGC GGGTGAAGGT AAATCAACCA CAACCGTGGG GCTGGCGCAG
GCTTTGGCTA AGATAGGTAA AAAAGCCATT GTCGCGCTGC GCGAACCTTC TTTAGGGCCG
TGTATGGGCA TTAAAGGCGG CGCGGCGGGG GGCGGATATT CCCAGGTTGT TCCTATGGAG
GATATTAACC TTCATTTCAC GGGGGATATG CACGCGATCA CAGCCGCTAA TAATTTGTTA
TCAGCTATTA TTGACAATCA CATACACCAG GGTAATGAAC TGGGTATAGA CGAAAGACGC
ATAGTATGGC ACCGTGTTGT TGATATCAAT GACCGCGCTT TAAGAAACAT AGTTGTCGCT
TTAGGCGGCA AAGGTAACGG GTTTCCCAGG GAAGACAGTT TTGATATAAC CGTAGCTTCT
GAAGTTATGG CTATTTTGTG TCTCTCCGAA AGTTTGGCCG ACCTTAAAAA AAGACTTTCT
AAAGTTATAG TCGGGTATAA TTTCGCGGAT AAACCCGTTA CCGCCGGCAT GCTTAAAGCG
GAAGGCGCTA TGGCCGCCTT ACTTAAAGAC GCCATTAAAC CTAACCTTGT GCAAACTTTA
GAAAACGTAC CCGCCATTAT ACACGGCGGT CCTTTTGCCA ATATCGCGCA TGGATGCAAC
AGCGTTATAG CAACAAAAAC CGCTTTAAAA CTTGCCGACT ATATTGTTAC GGAGGCGGGT
TTTGGCGCTG ATTTAGGCGC CGAGAAATTT TTTAACATAA AATGCCGCTA CGCGGGACTT
ACACCCAAAG TAGCGATTAT TGTAGCCACT GTGCGCGCGC TTAAAATGCA CGGCGGCGTA
AGCAAAGATA AATTAACCCA TCTTGATAAA CAGGCAGTAA TACGCGGGCT TGTTAATTTA
GATAAACATA TTGAAAACGT TAAAAAATTC GGCGTGCCGC CTGTTGTGGC CATAAATATT
TTCAGCGGCG ATTCCAAAGA GGAAATCGCC GCCGTAAAAG CGCATTGCAA AAAAATAGGC
GTGCCTGTTG AGCTTTCGGA CGTGTTTGCC AAGGGCGGCG AGGGCGGTAT CCAGCTTGCT
AAAAAAGTTG TGGATATTAT TTCAAAAAAC AAAAGCAAAT TTCGGTTTAC TTATGAATCG
GAAGACAGTT TAGAAGAAAA AACAAAAAAA ATAGTAAAAA ATATTTACGG AGCCAAAGAC
GTGTTTTTTG ATAAAAAAGC TTTAGACTCA ATAAAGAAAT ACGAGGCTAT GGGCTTTGGC
AATATCCCGG TTTGTATGGC TAAAACCCAG TATTCTTTTT CGGATAATCC AAAACTTTAC
GGAAGGCCCG AAGGCTTTAC CATTGAAGTT CGTGAAGCCA GGATTTCCGC CGGAGCGGGC
TTTGTCGTTA TGTTAACGGG TAATATTATG ACAATGCCGG GGCTTCCAAA GTTCCCCGCG
GCTGAAAAAA TTGATATTTC ATCCGAGGGC GTTATAAAAG GTTTATCATA A
 
Protein sequence
MLSDIEIAQR AKVWPIAKVA VKLGIKKSQI ELYGHYKAKL SFDCIKKLQK KPDGNLILVT 
AISPTAAGEG KSTTTVGLAQ ALAKIGKKAI VALREPSLGP CMGIKGGAAG GGYSQVVPME
DINLHFTGDM HAITAANNLL SAIIDNHIHQ GNELGIDERR IVWHRVVDIN DRALRNIVVA
LGGKGNGFPR EDSFDITVAS EVMAILCLSE SLADLKKRLS KVIVGYNFAD KPVTAGMLKA
EGAMAALLKD AIKPNLVQTL ENVPAIIHGG PFANIAHGCN SVIATKTALK LADYIVTEAG
FGADLGAEKF FNIKCRYAGL TPKVAIIVAT VRALKMHGGV SKDKLTHLDK QAVIRGLVNL
DKHIENVKKF GVPPVVAINI FSGDSKEEIA AVKAHCKKIG VPVELSDVFA KGGEGGIQLA
KKVVDIISKN KSKFRFTYES EDSLEEKTKK IVKNIYGAKD VFFDKKALDS IKKYEAMGFG
NIPVCMAKTQ YSFSDNPKLY GRPEGFTIEV REARISAGAG FVVMLTGNIM TMPGLPKFPA
AEKIDISSEG VIKGLS