Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2358 |
Symbol | |
ID | 4076477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2478640 |
End bp | 2480316 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638007680 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_614352 |
Protein GI | 99082198 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.269795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.879524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTATA AGAGTGACAT TGAGATCGCC CGCGCGGCGA ACAAGCTCCC GATCCAGGAG ATCGGCGCCA AACTGGGCAT GAAAAACGAC GACCTGCTGC CCTATGGCCA CGACAAGGCG AAGGTGAGCC AAGAGTTCAT CAACTCCGTG CAGGGCAATG AAGACGGCAA GCTGGTGCTG GTGACCGCGA TCAACCCGAC CCCGGCGGGT GAGGGCAAGA CCACCACCAC CGTTGGTCTG GGCGATGGTC TGAACCGTAT CGGCAAAAAT GCGATGATCT GTATCCGCGA AGCGTCCCTC GGGCCGAACT TCGGCATGAA GGGTGGCGCC GCCGGTGGCG GCATGGCGCA GGTTGTGCCG ATGGAGGAAA TGAACCTCCA CTTCACAGGC GACTTCCATG CGATCACCTC GGCGCACTCG TTGCTGTCGG CCATGATCGA CAACCACATC TACTGGGGCA ACGAGCAGGA AATCGACATC CGCCGCGTTG CATGGCGTCG TGTGGTCGAC ATGAACGACC GCGCCCTGCG CCAGATCACG GCGTCGCTGG GTGGGGTGTC CAACGGCTTC CCGCGCGAAA CCGGTTTTGA CATCACCGTG GCTTCCGAGG TCATGGCGAT CCTCTGCCTT GCCAACGACC TGAAGGACCT GGAAAAGCGT CTCGGCGACA TCATCGTGGC CTATCGCCGC GACAAGACCC CGGTTTACTG CCGCGACATC AAGGCCGAAG GCGCGATGAC CGTTCTCCTG AAGGACGCCA TGCAGCCCAA CCTCGTTCAG ACCCTGGAAA ACAACCCGGC GTTTGTACAC GGTGGCCCGT TTGCGAACAT CGCACATGGC TGTAACTCCG TGATCGCAAC CAAGACCGCG CTCAAAGTGG CGGATTACGT TGTCACAGAA GCAGGCTTTG GGGCCGACCT CGGCGCCGAG AAGTTCATGA ACATCAAATG CCGCAAGGCC GGCATCGCCC CCTCTGCGGT GGTGGTCGTT GCGACCGTGC GCGCCATGAA GATGAACGGT GGCGTGGCCA AGGCGGATCT GGGCGCGGAA AACGTCGAGG CGGTCAAGAA CGGCTGTGCA AACCTCGGTC GTCACATCGA GAACGTCAAA TCCTTTGGCG TGCCTGCGGT CGTGGCGATC AACCACTTCG TCACCGACAC CGATGCGGAG ATCAATGCGG TCAAGGAATA TGTCGCCTCT CACGGCGTCG AGGCGATCCT GTCGCGCCAC TGGGAGCTGG GCTCCGAGGG CTCCGCGCCG TTGGCTGAGA AGGTGGTCGA GCTTGTTGAG GGTGGCGGTG CAAACTTCGG TCCGCTCTAT CCCGATGAGA TGCCGCTGTT TGAGAAGATC GAAACCATCG CCAAGCGCAT CTATCGCGCT GACGAGGTTC TGGCCGATGC CAAGATCCGC AACCAGCTGA AAGAGTGGGA AGAAGCGGGC TATGGCCATC TGCCGGTCTG CATGGCGAAG ACCCAGTATT CGTTCTCGAC CGATCCGAAC CTGCGCGGTG CGCCCACCGG CCACTCGGTT CCGGTTCGCG AAGTGCGCCT CTCGGCGGGT GCGGGCTTTA TCGTGGTGGT TTGCGGTGAG ATCATGACCA TGCCGGGTCT GCCACGCACC CCGGCGGCGG AAAGCATCTG CCTCAATGAA GAGGGCCTGA TCGAAGGCTT GTTCTAA
|
Protein sequence | MSYKSDIEIA RAANKLPIQE IGAKLGMKND DLLPYGHDKA KVSQEFINSV QGNEDGKLVL VTAINPTPAG EGKTTTTVGL GDGLNRIGKN AMICIREASL GPNFGMKGGA AGGGMAQVVP MEEMNLHFTG DFHAITSAHS LLSAMIDNHI YWGNEQEIDI RRVAWRRVVD MNDRALRQIT ASLGGVSNGF PRETGFDITV ASEVMAILCL ANDLKDLEKR LGDIIVAYRR DKTPVYCRDI KAEGAMTVLL KDAMQPNLVQ TLENNPAFVH GGPFANIAHG CNSVIATKTA LKVADYVVTE AGFGADLGAE KFMNIKCRKA GIAPSAVVVV ATVRAMKMNG GVAKADLGAE NVEAVKNGCA NLGRHIENVK SFGVPAVVAI NHFVTDTDAE INAVKEYVAS HGVEAILSRH WELGSEGSAP LAEKVVELVE GGGANFGPLY PDEMPLFEKI ETIAKRIYRA DEVLADAKIR NQLKEWEEAG YGHLPVCMAK TQYSFSTDPN LRGAPTGHSV PVREVRLSAG AGFIVVVCGE IMTMPGLPRT PAAESICLNE EGLIEGLF
|
| |