Gene RoseRS_4285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4285 
Symbol 
ID5211269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5372011 
End bp5373141 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID640597872 
ProductGHMP kinase C terminal domain-containing protein 
Protein accessionYP_001278576 
Protein GI148658371 
COG category[I] Lipid transport and metabolism 
COG ID[COG3407] Mevalonate pyrophosphate decarboxylase 
TIGRFAM ID[TIGR01240] diphosphomevalonate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0398897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0848141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCG CCGATGCCGA ACTCCGTGCA GCGCTGCGCG CTCATGGGCT GAGGTGGGAA 
GCCTACCCGG ATCTGACAGG GGCTGCGCGC GTGCGTGGCG TGGCAGCCGC GCTCGCCTAC
CCGATGCAGG GGGTCTTGAA ATACCACGGA CTGAGCGACT GGAAGTATCG CATCGCCTTC
CTGCCGAGCA TCTCGCTCTG CAACGACGCA GGGCATACCC TGACGCTGGT TGAGTTCGAT
CCCGATCTGC CGGACGACAG CGCAACCATC AATGGTCAAC CGGCGCGTGG GCGTGAACTC
GAACGGGTGC AGCAGAGTCT CGACGCGATC CGCGCCGTAT CCGGCGCGAC CGTGCACGCC
CGTGTGACGT CACGCAATGT CACCCGCGGA ACACGCTTCG GAAAAGGGCT TGGGTCGAGC
GCGTCGGCGT CAGCGGCGCT GGCGCTGGCA GCAATCGCCG CCCTGTACGG CGAGGAGGCG
GCATCCAACC GTCGCCTGGT CAGTTGCATG GCGCGACTGC TGGCGGGTTC CGGCTGTCGC
AGTGCAGCTG GCGGATGTTC CATCTGGCTG TCGTACCCTG GCATCGCGCA CGAAGAGAGT
TTTGCCGTGC GCCTCGATGA TGCCGGGCAA CTCGATGATG TGCGCCTGAT CACTGTACCG
ATCGATTCGC GCATCGGACT GAAGACCGAA CAGGCGCATA TGGACGCGCC TGCGAGCGCG
CTGTTCCGCT GCTGGATGCT CAACCGTCGT GACGAGGCGC TGGCGTGCAT CGCGGCGGCG
CGTGCAGGCG ACTGGCGCAC CCTTGGGCAA TGGGCGGAAC TGGACAGCAT GCGACTGCAC
GGCATCACCA TGTCGGGGAG TCTGGAGAAC AAACTGATTG GCTGGGAACC GGAGAATATT
GCACTGTTTC GCATGTGCAA CGATCTGCGC AGCGGTGGCG TCCCCGTCTA CTGCTCGACC
GACACCGGTC CGACAGCGGT GTTCATCACT CACCGCGACT ATGAGGAGGC GGTCGTCGCA
GCCATCGAGG CGACCGGTCT TGGACTGGAA ACGATCCGTG GACGGATCGC CGGTCCGGCG
CGCCTGGTCG ATGTCGCCTG GGCGCAGGGA GCGTTGGGGG TTGAAGGGTG A
 
Protein sequence
MASADAELRA ALRAHGLRWE AYPDLTGAAR VRGVAAALAY PMQGVLKYHG LSDWKYRIAF 
LPSISLCNDA GHTLTLVEFD PDLPDDSATI NGQPARGREL ERVQQSLDAI RAVSGATVHA
RVTSRNVTRG TRFGKGLGSS ASASAALALA AIAALYGEEA ASNRRLVSCM ARLLAGSGCR
SAAGGCSIWL SYPGIAHEES FAVRLDDAGQ LDDVRLITVP IDSRIGLKTE QAHMDAPASA
LFRCWMLNRR DEALACIAAA RAGDWRTLGQ WAELDSMRLH GITMSGSLEN KLIGWEPENI
ALFRMCNDLR SGGVPVYCST DTGPTAVFIT HRDYEEAVVA AIEATGLGLE TIRGRIAGPA
RLVDVAWAQG ALGVEG