Gene TM1040_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2358 
Symbol 
ID4076477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2478640 
End bp2480316 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content61% 
IMG OID638007680 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_614352 
Protein GI99082198 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.269795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.879524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATA AGAGTGACAT TGAGATCGCC CGCGCGGCGA ACAAGCTCCC GATCCAGGAG 
ATCGGCGCCA AACTGGGCAT GAAAAACGAC GACCTGCTGC CCTATGGCCA CGACAAGGCG
AAGGTGAGCC AAGAGTTCAT CAACTCCGTG CAGGGCAATG AAGACGGCAA GCTGGTGCTG
GTGACCGCGA TCAACCCGAC CCCGGCGGGT GAGGGCAAGA CCACCACCAC CGTTGGTCTG
GGCGATGGTC TGAACCGTAT CGGCAAAAAT GCGATGATCT GTATCCGCGA AGCGTCCCTC
GGGCCGAACT TCGGCATGAA GGGTGGCGCC GCCGGTGGCG GCATGGCGCA GGTTGTGCCG
ATGGAGGAAA TGAACCTCCA CTTCACAGGC GACTTCCATG CGATCACCTC GGCGCACTCG
TTGCTGTCGG CCATGATCGA CAACCACATC TACTGGGGCA ACGAGCAGGA AATCGACATC
CGCCGCGTTG CATGGCGTCG TGTGGTCGAC ATGAACGACC GCGCCCTGCG CCAGATCACG
GCGTCGCTGG GTGGGGTGTC CAACGGCTTC CCGCGCGAAA CCGGTTTTGA CATCACCGTG
GCTTCCGAGG TCATGGCGAT CCTCTGCCTT GCCAACGACC TGAAGGACCT GGAAAAGCGT
CTCGGCGACA TCATCGTGGC CTATCGCCGC GACAAGACCC CGGTTTACTG CCGCGACATC
AAGGCCGAAG GCGCGATGAC CGTTCTCCTG AAGGACGCCA TGCAGCCCAA CCTCGTTCAG
ACCCTGGAAA ACAACCCGGC GTTTGTACAC GGTGGCCCGT TTGCGAACAT CGCACATGGC
TGTAACTCCG TGATCGCAAC CAAGACCGCG CTCAAAGTGG CGGATTACGT TGTCACAGAA
GCAGGCTTTG GGGCCGACCT CGGCGCCGAG AAGTTCATGA ACATCAAATG CCGCAAGGCC
GGCATCGCCC CCTCTGCGGT GGTGGTCGTT GCGACCGTGC GCGCCATGAA GATGAACGGT
GGCGTGGCCA AGGCGGATCT GGGCGCGGAA AACGTCGAGG CGGTCAAGAA CGGCTGTGCA
AACCTCGGTC GTCACATCGA GAACGTCAAA TCCTTTGGCG TGCCTGCGGT CGTGGCGATC
AACCACTTCG TCACCGACAC CGATGCGGAG ATCAATGCGG TCAAGGAATA TGTCGCCTCT
CACGGCGTCG AGGCGATCCT GTCGCGCCAC TGGGAGCTGG GCTCCGAGGG CTCCGCGCCG
TTGGCTGAGA AGGTGGTCGA GCTTGTTGAG GGTGGCGGTG CAAACTTCGG TCCGCTCTAT
CCCGATGAGA TGCCGCTGTT TGAGAAGATC GAAACCATCG CCAAGCGCAT CTATCGCGCT
GACGAGGTTC TGGCCGATGC CAAGATCCGC AACCAGCTGA AAGAGTGGGA AGAAGCGGGC
TATGGCCATC TGCCGGTCTG CATGGCGAAG ACCCAGTATT CGTTCTCGAC CGATCCGAAC
CTGCGCGGTG CGCCCACCGG CCACTCGGTT CCGGTTCGCG AAGTGCGCCT CTCGGCGGGT
GCGGGCTTTA TCGTGGTGGT TTGCGGTGAG ATCATGACCA TGCCGGGTCT GCCACGCACC
CCGGCGGCGG AAAGCATCTG CCTCAATGAA GAGGGCCTGA TCGAAGGCTT GTTCTAA
 
Protein sequence
MSYKSDIEIA RAANKLPIQE IGAKLGMKND DLLPYGHDKA KVSQEFINSV QGNEDGKLVL 
VTAINPTPAG EGKTTTTVGL GDGLNRIGKN AMICIREASL GPNFGMKGGA AGGGMAQVVP
MEEMNLHFTG DFHAITSAHS LLSAMIDNHI YWGNEQEIDI RRVAWRRVVD MNDRALRQIT
ASLGGVSNGF PRETGFDITV ASEVMAILCL ANDLKDLEKR LGDIIVAYRR DKTPVYCRDI
KAEGAMTVLL KDAMQPNLVQ TLENNPAFVH GGPFANIAHG CNSVIATKTA LKVADYVVTE
AGFGADLGAE KFMNIKCRKA GIAPSAVVVV ATVRAMKMNG GVAKADLGAE NVEAVKNGCA
NLGRHIENVK SFGVPAVVAI NHFVTDTDAE INAVKEYVAS HGVEAILSRH WELGSEGSAP
LAEKVVELVE GGGANFGPLY PDEMPLFEKI ETIAKRIYRA DEVLADAKIR NQLKEWEEAG
YGHLPVCMAK TQYSFSTDPN LRGAPTGHSV PVREVRLSAG AGFIVVVCGE IMTMPGLPRT
PAAESICLNE EGLIEGLF