Gene RoseRS_2573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2573 
Symbol 
ID5209542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3189277 
End bp3190293 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID640596177 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001276899 
Protein GI148656694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC CGCGCCTGAC CGATACGACC CTCCGTGATG GATCGCACGC ACTGGCGCAT 
ACCTTTACCC GTCAGCAGGT GCGTGATATT GTTCGCGCGC TGGATCGCGC CGGTGTGCCG
GTGATCGAAG TGACCCACGG CGATGGGTTG GCCGGTTCAT CGCTCCAGTA CGGTTTTTCG
CGCGTGCCGG ACCTCGACCT GATTGCCGAG GCGCGTGAGA CGGCGGAACG GGCGCGTATT
GCCGCGTTAC TCCTCCCCGG CATTGGCACG CGCCGCGAAC TGAGGGCCGC CGTCGAACGC
GGCGTTCAGG TACTGCGGAT CGCAACCCAG TGCACAGAAG CGGATATCAG CGAAGAGCAC
TTCAAAATGG CGAAAGACAT GGGGCTGGAG ACAGTCGGCT TTCTGATGAT GTCGCATATG
AGACCTCCTG AATTCCTTGC AGAACAGGCG CGTCTGATGG AGTCGTATGG CGCCGATTGC
GTCTACGTGG TGGACTCGGC GGGCGCCATG CTGCCGCATG ATGCGGCGGC GCGCGTCCAG
GCGCTCAAGG CGGCGCTCAC GGTGCAGGTC GGTTTCCATG CCCACAACAA TCTGGGGTTG
GGGATCGGCA ACACCCTGGC GGCGCTGGAA GCAGGCGCAG ACCAGATTGA TGGATGTCTG
CGGGGGTTGG GCGCCGGTGC GGGCAACGCC GCCACCGAGG TGCTGGCGGC GGTGCTCGAC
CGGCTGGGGA TCAACCCTGG TCTCGATGTG CTGGCGCTCA TGGACGCTGC CGAGTATGTG
GTGGCGCCGA TCATGCCATT TCAGCCCTTC CCCGACCGCG ATGCAATCAC GATCGGGTAT
GCCGGGGTCT ACTCGACCTT TCTGCTGCAT GCCAGGCGGA TCGGAGAACA GTTGGGCGTC
GATCCGCGCG CCATCCTGAT TGAACTGGGA CGGCGTCAGA CGGTTGCCGG GCAGGAAGAC
TGGATTCTTG ATGTGGCGCT CGAACTGGTG CGCCAGCAGC AGACCACGCC AGTATGA
 
Protein sequence
MNAPRLTDTT LRDGSHALAH TFTRQQVRDI VRALDRAGVP VIEVTHGDGL AGSSLQYGFS 
RVPDLDLIAE ARETAERARI AALLLPGIGT RRELRAAVER GVQVLRIATQ CTEADISEEH
FKMAKDMGLE TVGFLMMSHM RPPEFLAEQA RLMESYGADC VYVVDSAGAM LPHDAAARVQ
ALKAALTVQV GFHAHNNLGL GIGNTLAALE AGADQIDGCL RGLGAGAGNA ATEVLAAVLD
RLGINPGLDV LALMDAAEYV VAPIMPFQPF PDRDAITIGY AGVYSTFLLH ARRIGEQLGV
DPRAILIELG RRQTVAGQED WILDVALELV RQQQTTPV