Gene Rsph17029_2316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2316 
Symbol 
ID4895996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2449364 
End bp2451037 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content68% 
IMG OID640112912 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001044190 
Protein GI126463076 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.345965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCC AGACCGATAT CGAGATCGCG CGCGCCGCGC GAAAGAAGCC CATCCAGGAG 
ATCGGTGCCG GGCTCGGCAT CCCGGCCGAG GCGCTGATCC CCTACGGTCA CGACAAGGCC
AAGGTCGGAC AGGGCTTCAT CCGCGGGCTC GAGGGTCGAC CGGACGGCAA GCTTATCCTC
GTGACCGCGA TCAACCCCAC GCCCGCGGGC GAGGGAAAGA CCACGACGAC GGTCGGTCTC
GGCGACGGTC TGAACCGGAT CGGCAAGAAG GCGGTCATCT GCATCCGCGA GGCCTCGCTC
GGGCCGAACT TCGGCATGAA GGGCGGGGCG GCGGGCGGCG GGCGGGCGCA GGTCGTGCCG
ATGGAGGACA TGAACCTCCA TTTCACCGGC GATTTCCACG CAATCACGGC GGCGCACAAC
CTGCTGGCGG CCATGATCGA CAACCACATC TACTGGGGCA ACGCGCTGGA ACTCGACGCC
CGGCGCATCA CCTGGCGGCG GGTGATGGAC ATGAACGACC GCGCGCTGCG CGACACGGTG
GTGAACCTCG GCGGCGTGGC GAACGGATTT CCGCGCCAGA CGGGCTTCGA CATCACCGTG
GCCTCCGAGG TGATGGCGAT CCTCTGCCTC GCGGACGATC TGGAGGATCT CGAACGCCGG
CTGGGCCGGA TCGTCGTAGG CTACCGCCGC GACAAGAGCC CGGTCTATTG CCGCGACCTG
AAGGCCGCGG GGGCGATGGC CGTGCTGCTC AAGGATGCGA TGCAGCCGAA CCTCGTGCAG
ACGATCGAGA ACAACCCGGC CTTCGTCCAT GGCGGACCCT TCGCCAACAT CGCCCACGGC
TGCAACTCGG TGATCGCCAC GCGCACGGCG CTGAAGCTGG CCGACTATGT CGTGACCGAG
GCGGGCTTCG GCGCGGATCT CGGGGCCGAG AAGTTCTTCG ACATCAAGTG CCGGCTGGCG
GGGCTGAAGC CCTCGGCCGC CGTGGTCGTG GCCACGGTCC GGGCGCTCAA GATGAACGGG
GGCGTGGCGC GCGAGGATCT CGGGCGCGAG GATGTCGCGG CGCTCCGGCG CGGCTGCGCG
AACCTCGGGC GGCACATCGC CAATGTGAAG GGCTTCGGCG TGCCGGTCGT GGTGGCGATC
AACCACTTCA CCACCGACAC CGAGGCCGAG ATCGAGGCCG TGCGCGCCTA TGCGGCGGGG
CAGGGGGCAG AGGCGTTCCT CTGCCGCCAC TGGGCGGAAG GCTCGGCCGG GATCGAGGAT
CTGGCGCAGA AGGTGGTCGA GCTGGCCGAG ACGCCCTCGA TGTTCGCGCC GCTCTACCCC
GACGACATGC CGCTTTTCGA GAAGATGGAG ACCGTGGCAC GTCGCATCTA TCACGCACAT
GACGTGATTG CCGACCATGT GATCCGCGAC CAGCTGCGAA CATGGGAGGA AGCGGGATAC
GGGGCGCTGC CGGTATGCAT GGCCAAGACG CAATACAGCT TCACGACCGA TGCGGCGATC
CGGGGCGCGC CCGAGGGGCA CTCCATTCCC ATCCGCGAGG TGAGGCTGGC GGCCGGCGCG
GGATTCGTCG TCGCGATCTG CGGCGAGATC CGCACCATGC CGGGCCTGCC GAGCCAGCCC
GCGGCCGAAC TTATCCATCT GGACGAAGAG GGACGGATCG AAGGCCTCTT CTGA
 
Protein sequence
MAVQTDIEIA RAARKKPIQE IGAGLGIPAE ALIPYGHDKA KVGQGFIRGL EGRPDGKLIL 
VTAINPTPAG EGKTTTTVGL GDGLNRIGKK AVICIREASL GPNFGMKGGA AGGGRAQVVP
MEDMNLHFTG DFHAITAAHN LLAAMIDNHI YWGNALELDA RRITWRRVMD MNDRALRDTV
VNLGGVANGF PRQTGFDITV ASEVMAILCL ADDLEDLERR LGRIVVGYRR DKSPVYCRDL
KAAGAMAVLL KDAMQPNLVQ TIENNPAFVH GGPFANIAHG CNSVIATRTA LKLADYVVTE
AGFGADLGAE KFFDIKCRLA GLKPSAAVVV ATVRALKMNG GVAREDLGRE DVAALRRGCA
NLGRHIANVK GFGVPVVVAI NHFTTDTEAE IEAVRAYAAG QGAEAFLCRH WAEGSAGIED
LAQKVVELAE TPSMFAPLYP DDMPLFEKME TVARRIYHAH DVIADHVIRD QLRTWEEAGY
GALPVCMAKT QYSFTTDAAI RGAPEGHSIP IREVRLAAGA GFVVAICGEI RTMPGLPSQP
AAELIHLDEE GRIEGLF