Gene Rleg_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3083 
Symbol 
ID8013993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3080641 
End bp3082320 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content62% 
IMG OID644825651 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_002976879 
Protein GI241205783 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCA TCAAATCCGA TATCGAAATC GCACGCGCCG CGGCCAAAAA GCCGATCTTC 
GAAATCGGGG CGAAACTCGG CATTCCGGTC GAGCAGCTCG TTCCCTATGG TCATGACAAA
GCGAAGGTCA GCGCCGAGTT CATTGCTGCG CAAGCAGGCA AGAAGGATGG CAAGCTGATC
CTCGTCACCG CGATCAACCC GACGCCGGCG GGCGAGGGCA AGACGACCAC CACCGTCGGG
CTCGGCGACG GGCTGAACCG GATCGGCAAG AAAGCCGTTG TCTGCATCCG CGAGGCCTCG
CTCGGCCCCT GCTTCGGCGT CAAGGGCGGG GCGGCCGGCG GCGGTTATGC ACAGGTCGTG
CCGATGGAAG ACATCAACCT GCATTTCACC GGCGATTTCC ATGCGATCAC ATCGGCGCAC
AATCTGCTGG CGGCGATGAT CGACAACCAC ATCTACTGGG GCAACGAAGA GAATATCGAC
ATCCGCCGCA TCACTTGGCG GCGGGTCATG GACATGAACG ACCGGGCGCT CAGAAGCATG
GTCTCATCGC TCGGCGGCGT CGCCAACGGT TTCCCGCGCC AGGGCGGGTT CGACATCACC
GTCGCCTCCG AGGTGATGGC GATCCTCTGC CTTGCCACCG ACCTCAAGGA TCTGGAGCGG
CGGCTTGGCG ACATCATCAT CGGCTATCGT TTCGACAGGA CGGCGGTGCA AGCGCGAGAC
CTGAAAGCCG ACGGCGCAAT GGCGGTGCTG TTGAAAGATG CGATGCAGCC GAACCTCGTG
CAGACGCTAG AGAACAATCC GGCCTTCGTG CATGGCGGCC CCTTCGCCAA CATCGCCCAT
GGCTGCAACT CGGTGACGGC GACGAAGACG GCGCTGAAGC TTGGCGACTA TGTGGTGACC
GAAGCGGGCT TCGGCGCCGA TCTCGGGGCC GAGAAATTCT TCGACATCAA GTGCCGCAAG
GCCGGGCTGA AGCCGGATGC GGCGGTGATC GTCGCGACCG TCCGGGCGCT GAAGATGAAT
GGCGGGGTGA AGAAGGAAGA TCTCGGCACG GAGGATGTCG AGGCGCTGAA GAAGGGCTGC
GCCAATCTTG GCCGGCACGT TGCCAATGTG CGTCGTTTCC GGGTGCCCGT CGTCGTGGCG
ATCAACCATT TCGTCTCGGA TACCGACGCC GAGATCGCGG CAGTCAAGGA ATTTGTCTCG
AGGCTTGGCG CCGAGGCGAT CCTCTGCCGC CACTGGGCGC TGGGTTCGGC CGGCATCGAG
GAACTGGCGC ACAAGGTGGT GGAACTGGCC GAATCCGGGC AGGCGAAATT CCAGCCGCTC
TATGGCGACG ACATTTCGCT GTTCGAGAAG ATCGAGATCG TCGCCTCGAA GATCTACCAT
GCCGGTGAAG TGACGGCCGA CAAGGCGGTG CGCGACCAGT TGCAGACATG GGAGGAGCAA
GGTTACGGCA AGCTGCCGAT CTGCATGGCG AAGACGCAAT ATTCCTTCTC CACCGATCCG
AACCTGCGCG GCGCGCCGGA AGGCCACATC GTCACCGTGC GAGAGGTGCG GCTTTCGGCG
GGAGCGGGCT TCGTCGTCGC CATCACCGGC GAGATCATGA CGATGCCGGG CCTGCCGAAA
TCACCGTCGG CGGAACGGAT TTTCCTGAAC GACCAGGGCT ATATCGAGGG GTTGTTTTAG
 
Protein sequence
MPSIKSDIEI ARAAAKKPIF EIGAKLGIPV EQLVPYGHDK AKVSAEFIAA QAGKKDGKLI 
LVTAINPTPA GEGKTTTTVG LGDGLNRIGK KAVVCIREAS LGPCFGVKGG AAGGGYAQVV
PMEDINLHFT GDFHAITSAH NLLAAMIDNH IYWGNEENID IRRITWRRVM DMNDRALRSM
VSSLGGVANG FPRQGGFDIT VASEVMAILC LATDLKDLER RLGDIIIGYR FDRTAVQARD
LKADGAMAVL LKDAMQPNLV QTLENNPAFV HGGPFANIAH GCNSVTATKT ALKLGDYVVT
EAGFGADLGA EKFFDIKCRK AGLKPDAAVI VATVRALKMN GGVKKEDLGT EDVEALKKGC
ANLGRHVANV RRFRVPVVVA INHFVSDTDA EIAAVKEFVS RLGAEAILCR HWALGSAGIE
ELAHKVVELA ESGQAKFQPL YGDDISLFEK IEIVASKIYH AGEVTADKAV RDQLQTWEEQ
GYGKLPICMA KTQYSFSTDP NLRGAPEGHI VTVREVRLSA GAGFVVAITG EIMTMPGLPK
SPSAERIFLN DQGYIEGLF