Gene RoseRS_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4165 
Symbol 
ID5211149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5215329 
End bp5216900 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content60% 
IMG OID640597754 
Productmalate synthase 
Protein accessionYP_001278459 
Protein GI148658254 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.975172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACAC CGTATCGTGT CGAACTTCTT GGTCCGACCA GGCCGGAATG GTCAGAGATC 
CTCACGGCTG AAGCGCTCGA TTTTGTTGCC TCACTGGCGC GTCAGTTCGA GCATCGCCGG
CGCGCGCTGC TGGCTGCACG TGATCAGCGA TGGGCGGACA TCAAATCCGG CGCACTGCCC
GATTTCCTGC CTGAGACGGT TGACATTCGC GGCGGCGATT GGAGGGTGGC TTCCATTCCC
GCTGACTTTT CCAATCGGCG GGTTGAGATT ACCGGTCCTA CTGATCGGCG TATGGTGATC
AATGCGCTCA ACTCTGGCGC ACAGGTCTTT ATGGCGGATT TTGAGGATGC CAACGCCCCA
ACCTGGGAAA ACATCGTTCA GGGGCAACTC AACCTGCGTG ATGCCGTTCG CCGGACGATC
ACCTTCGTCA GCCCGGAGGG GCGTGAGTAT CGCCTGAATG ACACAACCGC CACCCTGGCG
GTGCGACCGC GCGGCTGGCA CCTTGTCGAG AAGCATGTCC ACGTTGATGA TGAGCCAGTG
GCTGGCGCTT TCTTCGATTT CGGGTTGTAC TTTTTCCACA ATGCGCACGA GTTGATCCGA
CGCGGTAGCG GTCCGTATTT CTACCTGCCG AAGATGCAGA GCCACCTGGA AGCGCGGCTC
TGGAACGACG TGTTCAACTT TGCGCAGGAT CGGCTCGGCA TCCCGCACGG TACGATCCGC
GCCACCGTAC TGATCGAGCA CATTCTGGCG GCGTTCGAGA TGGAAGAGAT TCTGTACGAG
TTGCGCGAAC ACAGCAGCGG TTTGAACCTG GGTCGCTGGG ATTATATCTA CAGTTTCATC
AAGACGTTCA GCCACCGCGA CGACTGGATC TTCCCCGATC GCGCACAGGT GACGATGACG
ACCCACTTCC TGCGTTCAGC GGCGGAACTC GTGGTCTATG CGTGCCACAA GCACGGCGCC
CACGCGCTCG GCGGCATGTC GGCGTTCATT CCGAACCGCC GCGAACCGGA GATTACCGAA
CGCGCCCTGG CGCAGGTGCG CGCCGATAAA GAGCGCGAGG CGAAGCAAGG GTTCGATGGC
GCCTGGGTGG CGCATCCCGA CCTGGTGCCG ACGGTGCTCG AAGTCTTCAA CACGGCGTTT
CCGGGTGATC ATCAGATCCA CTATGTGCCC GAGGTGCACG TCACCGCTGC CGATCTGCTG
ACCATCCCGC AGGGAACCAT CACCGAAGCC GGGTTGCGCA ACAATATCAA TGTGGCGCTG
CAATACCTCG AGGCGTGGCT TGGCGGTCGC GGCGCGGTTG CGATTTTCAA TCTGATGGAA
GATGTGGCGA CTGCTGAGAT TGCGCGTTCG CAGATCTGGC AGTGGGTGCG CTACAACGCG
AAACTGAACG ATGGTCGCAC GGTCGATGAG ACCATGTACA AGACGATCCG TGATGAAGAA
TTGCACGCAC TCGTCACTGC CCGCACCGGC GATCATCACT TCGCGCAGGC TGCCGAACTC
CTCGATGAAC TGACACTGTC GCATGATTTT GTCGAGTTCC TGACCATCCC CGGCTACCGT
CGTCTGGATT GA
 
Protein sequence
MDTPYRVELL GPTRPEWSEI LTAEALDFVA SLARQFEHRR RALLAARDQR WADIKSGALP 
DFLPETVDIR GGDWRVASIP ADFSNRRVEI TGPTDRRMVI NALNSGAQVF MADFEDANAP
TWENIVQGQL NLRDAVRRTI TFVSPEGREY RLNDTTATLA VRPRGWHLVE KHVHVDDEPV
AGAFFDFGLY FFHNAHELIR RGSGPYFYLP KMQSHLEARL WNDVFNFAQD RLGIPHGTIR
ATVLIEHILA AFEMEEILYE LREHSSGLNL GRWDYIYSFI KTFSHRDDWI FPDRAQVTMT
THFLRSAAEL VVYACHKHGA HALGGMSAFI PNRREPEITE RALAQVRADK EREAKQGFDG
AWVAHPDLVP TVLEVFNTAF PGDHQIHYVP EVHVTAADLL TIPQGTITEA GLRNNINVAL
QYLEAWLGGR GAVAIFNLME DVATAEIARS QIWQWVRYNA KLNDGRTVDE TMYKTIRDEE
LHALVTARTG DHHFAQAAEL LDELTLSHDF VEFLTIPGYR RLD