Gene RPD_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0200 
Symbol 
ID4020658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp223692 
End bp225038 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content69% 
IMG OID637960379 
ProductFolC bifunctional protein 
Protein accessionYP_567341 
Protein GI91974682 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCAG CGACTGCTCC GCAGCCGTCT GCTCTGGTCG GCGAGCTGCG CGCGCGACTT 
GCGCAGCTCC ATCCCGCACA GATCGATCTG ACGCTCGGCC GGATCGAACG GTTGCTGGCG
GCGCTCGATC ATCCGCAGCG CAGGCTGCCG CCGGTGATCC ACATCGCCGG CACCAACGGC
AAGGGCTCTA CCCTCGCCTT TCTCCGCGCC ATTCTCGAAG CCGCCGGCCT CAGCGTCCAC
GCCTACACCT CGCCGCATCT GGTCCGCGTC AACGAAACCG TCCGTCTCGG ACGGCCGGGC
GGCGGCGCGC TGGTGAGCGA CGATGAATTC GCCGCGGCGC TGGCGCATTG CGAGCGCGTC
AATCAAGGCG CGCCGATCAC GCTGTTCGAG ATCGAAACCG CCGCGGCGCT GTGGCTGTTC
GCGCAACATC CCGCCGACGT CACGCTGCTG GAAGTCGGCC TCGGTGGCCG GCTCGACGCC
ACCAACGTGA TCGACCAGCC GCTCGCCTGT GTGCTGACCC CGATCGGCAT CGACCACACC
GAGTTTCTCG GGCCGACGCT CGCGGACATC GCCGCCGAAA AGGCTGGCAT CATCCGCCGC
GGTGTTCCGG TGATCGTGGC CGGGCAGCAG AACGATGCGA TGGACGTGAT CGAGCGCGAA
GCCGAGCGGC TACGCGCGCC GCTGCACGCG CGCGGCCAGC AATGGCATGT CGAGGTCGAA
CACGGCCGGC TCGCCTATCA GGACGACCGC GGCCTCATGG ACCTCACCGC GCCAAAACTG
TTCGGCCGGC ACCAGATCGA CAATGCCTGG CTGGCGATCG CGACGCTGCG CGCGCAACAA
CGCTTCACCT TTGACCAGGC CGCCTATCAG GCAGGGTTGT TGTCGGCGGA CTGGCCGGCG
CGGATGCAGC GGCTGACGAC CGGCAGGCTG ATCGACGAAG CGCCACCCGG CAGCGAACTC
TGGCTCGACG GCGGCCACAA TGCCGACGGC GGCCGCGTCG CCGCAGCGGC GCTCGGCGAT
CTGGAAGAGC GGGTGTCGCG GCCGCTGGTG ATCATTGCCG GCATGATGGC CAACAAGGAC
GCCAGCGCGT TCCTGACCAA TTTCACCGGA CTGACCCGCC ACGTCATCGC GGTGCCGATC
CCCGATCGCG ACGGCGCGAT GCCGCCGGAA AAGCTCGCCG ACGCCGGGCG CGCGCTCGGC
CTGCGGGTCG AACTCGCCGA TAGCGTGGAG GCGGCGCTGA GCCGGATCGC CGGCCTTGCC
TATGAGCTGC CACCGCGCAT CCTGATCACC GGCTCGTTGT ATCTCGCCGG CCATGTACTG
CGCCTCAACG GCACAATGCC GAGCTGA
 
Protein sequence
MSAATAPQPS ALVGELRARL AQLHPAQIDL TLGRIERLLA ALDHPQRRLP PVIHIAGTNG 
KGSTLAFLRA ILEAAGLSVH AYTSPHLVRV NETVRLGRPG GGALVSDDEF AAALAHCERV
NQGAPITLFE IETAAALWLF AQHPADVTLL EVGLGGRLDA TNVIDQPLAC VLTPIGIDHT
EFLGPTLADI AAEKAGIIRR GVPVIVAGQQ NDAMDVIERE AERLRAPLHA RGQQWHVEVE
HGRLAYQDDR GLMDLTAPKL FGRHQIDNAW LAIATLRAQQ RFTFDQAAYQ AGLLSADWPA
RMQRLTTGRL IDEAPPGSEL WLDGGHNADG GRVAAAALGD LEERVSRPLV IIAGMMANKD
ASAFLTNFTG LTRHVIAVPI PDRDGAMPPE KLADAGRALG LRVELADSVE AALSRIAGLA
YELPPRILIT GSLYLAGHVL RLNGTMPS