Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_1052 |
Symbol | |
ID | 6263906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 1145050 |
End bp | 1146720 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642611532 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_001875942 |
Protein GI | 187251460 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAGCG ATATCGAAAT TGCGCAGCGC GCTAAAGTAT GGCCTATCGC CAAAGTGGCC GTAAAATTGG GTATAAAAAA ATCCCAAATA GAACTTTACG GACACTATAA GGCAAAACTT TCTTTTGACT GCATAAAAAA ATTGCAAAAG AAACCTGACG GCAACCTTAT TTTAGTTACG GCCATTTCAC CAACTGCCGC GGGTGAAGGT AAATCAACCA CAACCGTGGG GCTGGCGCAG GCTTTGGCTA AGATAGGTAA AAAAGCCATT GTCGCGCTGC GCGAACCTTC TTTAGGGCCG TGTATGGGCA TTAAAGGCGG CGCGGCGGGG GGCGGATATT CCCAGGTTGT TCCTATGGAG GATATTAACC TTCATTTCAC GGGGGATATG CACGCGATCA CAGCCGCTAA TAATTTGTTA TCAGCTATTA TTGACAATCA CATACACCAG GGTAATGAAC TGGGTATAGA CGAAAGACGC ATAGTATGGC ACCGTGTTGT TGATATCAAT GACCGCGCTT TAAGAAACAT AGTTGTCGCT TTAGGCGGCA AAGGTAACGG GTTTCCCAGG GAAGACAGTT TTGATATAAC CGTAGCTTCT GAAGTTATGG CTATTTTGTG TCTCTCCGAA AGTTTGGCCG ACCTTAAAAA AAGACTTTCT AAAGTTATAG TCGGGTATAA TTTCGCGGAT AAACCCGTTA CCGCCGGCAT GCTTAAAGCG GAAGGCGCTA TGGCCGCCTT ACTTAAAGAC GCCATTAAAC CTAACCTTGT GCAAACTTTA GAAAACGTAC CCGCCATTAT ACACGGCGGT CCTTTTGCCA ATATCGCGCA TGGATGCAAC AGCGTTATAG CAACAAAAAC CGCTTTAAAA CTTGCCGACT ATATTGTTAC GGAGGCGGGT TTTGGCGCTG ATTTAGGCGC CGAGAAATTT TTTAACATAA AATGCCGCTA CGCGGGACTT ACACCCAAAG TAGCGATTAT TGTAGCCACT GTGCGCGCGC TTAAAATGCA CGGCGGCGTA AGCAAAGATA AATTAACCCA TCTTGATAAA CAGGCAGTAA TACGCGGGCT TGTTAATTTA GATAAACATA TTGAAAACGT TAAAAAATTC GGCGTGCCGC CTGTTGTGGC CATAAATATT TTCAGCGGCG ATTCCAAAGA GGAAATCGCC GCCGTAAAAG CGCATTGCAA AAAAATAGGC GTGCCTGTTG AGCTTTCGGA CGTGTTTGCC AAGGGCGGCG AGGGCGGTAT CCAGCTTGCT AAAAAAGTTG TGGATATTAT TTCAAAAAAC AAAAGCAAAT TTCGGTTTAC TTATGAATCG GAAGACAGTT TAGAAGAAAA AACAAAAAAA ATAGTAAAAA ATATTTACGG AGCCAAAGAC GTGTTTTTTG ATAAAAAAGC TTTAGACTCA ATAAAGAAAT ACGAGGCTAT GGGCTTTGGC AATATCCCGG TTTGTATGGC TAAAACCCAG TATTCTTTTT CGGATAATCC AAAACTTTAC GGAAGGCCCG AAGGCTTTAC CATTGAAGTT CGTGAAGCCA GGATTTCCGC CGGAGCGGGC TTTGTCGTTA TGTTAACGGG TAATATTATG ACAATGCCGG GGCTTCCAAA GTTCCCCGCG GCTGAAAAAA TTGATATTTC ATCCGAGGGC GTTATAAAAG GTTTATCATA A
|
Protein sequence | MLSDIEIAQR AKVWPIAKVA VKLGIKKSQI ELYGHYKAKL SFDCIKKLQK KPDGNLILVT AISPTAAGEG KSTTTVGLAQ ALAKIGKKAI VALREPSLGP CMGIKGGAAG GGYSQVVPME DINLHFTGDM HAITAANNLL SAIIDNHIHQ GNELGIDERR IVWHRVVDIN DRALRNIVVA LGGKGNGFPR EDSFDITVAS EVMAILCLSE SLADLKKRLS KVIVGYNFAD KPVTAGMLKA EGAMAALLKD AIKPNLVQTL ENVPAIIHGG PFANIAHGCN SVIATKTALK LADYIVTEAG FGADLGAEKF FNIKCRYAGL TPKVAIIVAT VRALKMHGGV SKDKLTHLDK QAVIRGLVNL DKHIENVKKF GVPPVVAINI FSGDSKEEIA AVKAHCKKIG VPVELSDVFA KGGEGGIQLA KKVVDIISKN KSKFRFTYES EDSLEEKTKK IVKNIYGAKD VFFDKKALDS IKKYEAMGFG NIPVCMAKTQ YSFSDNPKLY GRPEGFTIEV REARISAGAG FVVMLTGNIM TMPGLPKFPA AEKIDISSEG VIKGLS
|
| |