Gene Smed_2298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2298 
Symbol 
ID5323159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2375519 
End bp2377198 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content61% 
IMG OID640791236 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001327965 
Protein GI229577668 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.307392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGAGG TAAAGACCGA CATCGAGATC GCGCGCGCGG CGCGCAAGCA ACCAATCATG 
GAGGTCGGCG CGAGACTCGG TATCCCGCCG GAGCATCTGC TTCCCTACGG CCATGACAAG
GCCAAGGTGA GCGCCGAGTT CATCGCGGCG CAAAGGGAGA AGAAGAACGG GCGGCTCATC
CTGGTCACAG CGATCAACCC GACGCCGGCG GGCGAGGGCA AGACGACGAC GACCGTCGGT
CTCGTCGATG GCTTGAACCG TATCGGCAAG AAGGCGATCG TCTGTATTCG CGAGGCCTCG
CTCGGTCCTT GCTTCGGTAT CAAGGGCGGG GCGGCCGGTG GCGGTTATGC GCAGGTCGTG
CCGATGGAGG ACATCAATCT CCACTTCACC GGCGATTTCC ACGCGATCAC TTCGGCCCAC
AACCTTCTCT CCGCCTTGAT AGACAACCAC ATCTATTGGG GGAACGAGCA GGCGATTGAC
ATCCGCCGAA TTGCCTGGCG GCGGGTCATG GACATGAACG ACCGGGCGCT GCGCCATATC
GTCGGCTCGC TCGGCGGGGT CGCCAACGGC TATCCACGAG AGACCGGCTT CGACATTACC
GTCGCCTCGG AAGTCATGGC GATCCTCTGT CTCGCGATGG ACATCAGGGA CCTCGAAAGG
CGGCTCGGCA ACATCATCAT CGGCTATCGG CGCGACAAGA GCCCGGTCTA TGCGCGCGAT
ATCAAGGCCG ACGGAGCCAT GGCGGTGCTG CTCAAGGACG CGATGCAGCC GAACCTGGTG
CAGACGCTCG AGAACAATCC GGCATTCGTG CACGGCGGTC CGTTCGCCAA TATCGCGCAT
GGCTGCAATT CGGTCGTTGC CACCACGACG GCGCTGAAGC TTGCCGATTA CGTCGTGACC
GAAGCGGGTT TCGGTGCGGA TCTCGGTGCC GAGAAGTTCT TCGACATAAA GTGCCGCAAG
GCAGGCCTCA TGCCGGATGC TGCGGTGATC GTTGCGACGG TCCGGGCAAT CAAGATGAAC
GGCGGCGTGA AGAAGGAGGA TCTCGCGAAA GAAAATGTCG AAGCGCTCAG GAAGGGGTGC
CCGAACCTCG GGCGCCACAT CCAGAACGTC AAGAAGTTCG GCGTACCGGT GCTCGTCGCA
ATCAACCACT TCACTTCCGA TACCGAGGCC GAAATCCAGG CGATCAAGGA CTATGTCCGC
ACGCTCGGTT CCGAAGCGGT CTTATGCAAG CATTGGGCAG AGGGCTCAGC CGGCATCGAG
GAACTTGCCG ATAAGGTTGC CGATCTCGCC GACGCCGGCC ATTCGCAGTT TTCGCCGCTC
TATCCCGACG AGATGCCGCT TTTTCAGAAG ATCGAGGCGA TCGCCAAGGA TATTTATCAC
GCGAGCGAGG TGATCGCCGA CAAGCTGGTG CGCGACCAGC TTCGAATCTG GGAAGACCAG
GGTTACGGTC ATCTGCCGAT CTGCATGGCC AAGACGCAAT ATTCCTTCTC CACCGACCCG
AATCTCCGCG GCGCGCCCAC CGGCCACACC GTACCGATAC GCGAGGTTCG CCTGGCCGCC
GGCGCTGGGT TCATCGTCGT CATCACCGGC GAGATCATGA CGATGCCGGG CCTGCCGAAA
GTACCCTCTT CGGAGCGGAT ACGGCTCGAC GAGGAGGGAT ACATCGAGGG TTTGTTCTGA
 
Protein sequence
MGEVKTDIEI ARAARKQPIM EVGARLGIPP EHLLPYGHDK AKVSAEFIAA QREKKNGRLI 
LVTAINPTPA GEGKTTTTVG LVDGLNRIGK KAIVCIREAS LGPCFGIKGG AAGGGYAQVV
PMEDINLHFT GDFHAITSAH NLLSALIDNH IYWGNEQAID IRRIAWRRVM DMNDRALRHI
VGSLGGVANG YPRETGFDIT VASEVMAILC LAMDIRDLER RLGNIIIGYR RDKSPVYARD
IKADGAMAVL LKDAMQPNLV QTLENNPAFV HGGPFANIAH GCNSVVATTT ALKLADYVVT
EAGFGADLGA EKFFDIKCRK AGLMPDAAVI VATVRAIKMN GGVKKEDLAK ENVEALRKGC
PNLGRHIQNV KKFGVPVLVA INHFTSDTEA EIQAIKDYVR TLGSEAVLCK HWAEGSAGIE
ELADKVADLA DAGHSQFSPL YPDEMPLFQK IEAIAKDIYH ASEVIADKLV RDQLRIWEDQ
GYGHLPICMA KTQYSFSTDP NLRGAPTGHT VPIREVRLAA GAGFIVVITG EIMTMPGLPK
VPSSERIRLD EEGYIEGLF