Gene RoseRS_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4541 
Symbol 
ID5211526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5694764 
End bp5696125 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content63% 
IMG OID640598119 
ProductFolC bifunctional protein 
Protein accessionYP_001278822 
Protein GI148658617 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.195738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.774158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACC AGCAAGCGCT CGATTATCTC TACTCATTCA TTGCCGGACA ACAGGGCGCG 
TCGCGCCCGC CGCCGATGAT GAACCTGGTG CGCACGCGTG CGCTCCTGGC GGCGCTCGGT
AATCCGCACC ACGCAATGCC GTCCGTGATC ATCGCTGGCA CCAAAGGCAA AGGTTCGACC
GCTGCACTGC TGGAAGCAAT TGTGCGCGCA GCCGGGTTGC GCACCGGACT GTGGACATCG
CCGCACCTGC ATACCTACCG TGAGCGGATT CAGGTCAACC GTACTCCGAT GACCCGTGAC
GAACTGGTCC GTGCGGTTGA GTCGATCCAG CCGGTTATCG AATCGATGAT GAGCGGACCT
GTCGGCGCGC CAGTGACATT TGCGATTGGA TTTGCCCTGG CGCTGCGCTA TTTCGCCGAA
CACGCCGTCG ATCTGGCGAT CCTCGAAGTC GGTGTGGGCG GGCGTTTCGA TAGCGCGGCA
GTGGTGACGC CGATTCTCAG CGTGATAACA CCGATCAGTT ACGACCATAT GGACCTGCTG
GGGGATACAT TGGCGCAGAT CGCGTGGGAG AAGGCGGGCA TTATGAAGCC GGGCGTTCCT
GTCATCAGTG CACCGCAACA TCCGGAAGCG CGTGAAACGC TGATCCGTTG CGCTGCGGAG
ATCGGCGCGC CGCTGTATTT CGTGCGTGAT ACGCTGGCGC CTTCGGCAAT CCAGGACGAT
CATTCCACCG GCGCCTGGCC CATACCTGCC GGACAGGGTC TGGACATTGA GCCGCAGCAG
CGCATCCTTT TTGACATCGC GCCGTCGCTC CCCGGTGTGT TCCAGAAGGT GAACGCGCGC
CTGGCGACCG GCGCAGCGCT GCTTCTCCGC GATGCAGGTC TGCCGATAGC CGACGATGCC
ATTGCCCGTG GTCTGGCTGA CGCGCGCTGG CCCGGACGGA TGGAGATTAT CGACGGTGCG
CCGCCAATTG TGCTCGACGG CGCACACAAT GGCGAGTCGA TGCGTCAACT GGTGCAATCG
CTGCGCTGTC TCTTCCCGAA CAGGCGCTAC GTCGTTGTGT TTGGCGCATC GCGCGACAAA
GACCTGGAGC GTATGCTTCC CGAACTGGCG CCGGCGGTTG ATGCGCTTGT GCTGACCGCG
TCGCGCCATC CGCGCGCGCT GGTTGCGCTC GACGAATTGC GGCAGCGGTT TATCCCGCTG
GTGCGCGAAG GTGTGACGAT CGACATCGTT CTGGATCCGG CAGAAGCGCT GGCGCATGCG
CGCGCATATG CTACTGACGA CGATCTGATC TGTGTCACCG GTTCACTGTT CATCGTCGCC
GCAGCGCGCG AGGCGCTGGG TCTGGCGCAA GAACGCGACT GA
 
Protein sequence
MDYQQALDYL YSFIAGQQGA SRPPPMMNLV RTRALLAALG NPHHAMPSVI IAGTKGKGST 
AALLEAIVRA AGLRTGLWTS PHLHTYRERI QVNRTPMTRD ELVRAVESIQ PVIESMMSGP
VGAPVTFAIG FALALRYFAE HAVDLAILEV GVGGRFDSAA VVTPILSVIT PISYDHMDLL
GDTLAQIAWE KAGIMKPGVP VISAPQHPEA RETLIRCAAE IGAPLYFVRD TLAPSAIQDD
HSTGAWPIPA GQGLDIEPQQ RILFDIAPSL PGVFQKVNAR LATGAALLLR DAGLPIADDA
IARGLADARW PGRMEIIDGA PPIVLDGAHN GESMRQLVQS LRCLFPNRRY VVVFGASRDK
DLERMLPELA PAVDALVLTA SRHPRALVAL DELRQRFIPL VREGVTIDIV LDPAEALAHA
RAYATDDDLI CVTGSLFIVA AAREALGLAQ ERD