Gene Rcas_4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4376 
Symbol 
ID5541889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5629822 
End bp5631393 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content59% 
IMG OID640896482 
Productmalate synthase 
Protein accessionYP_001434418 
Protein GI156744289 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACAC CGCAGCGTGT CGAAATTCTC GGTTCGACCA GGCCGGAGTG GTCGGAGATC 
CTCTCGGTTG AAGCGCTCGA TTTTATTGCT GCTCTGGCGC GTCAGTTCGA ACATCGCCGG
CGCGCGTTGC TTGCGGCGCG CGAGCAGCGT TGGGCGGATA TCAGAGGGGG CGCATTGCCC
GATTTTCTGT CGGAGACGGC TGAGATTCGC GGCAGTGATT GGAAGGTGGC GTCGATCCCT
GCCGATTTCT CGAACCGGCG TGTGGAGATT ACCGGTCCGA CCGATCGGCG TATGGTGATT
AACGCCCTGA ACTCCGGTGC GCAGGTGTTC ATGGCCGATT TTGAGGACGC GAATGCGCCC
ACCTGGGAGA ATATGGTGCA GGGGCAGCTC AACCTGCGCG ATGCCGTGCG TCGGACGATC
ACGTTCGTCA GTCCGGAGGG GCGCGAGTAT CGCCTGAACG ACACGATTGC AACGCTGGCG
GTGCGTCCGC GTGGCTGGCA TCTGATTGAG AAGCATGTCC ATGTGGACGG CGAGCCGGTG
GCCGGCGCGT TCTTCGATTT CGGTCTTTAT TTCTTCCACA ATGCGCGCGA ATTGATCCGA
CGCGGCAGCG GTCCGTACTT CTATCTGCCG AAGATGCAGA GCCATCTGGA AGCGCGCCTG
TGGAACGATG TGTTCAATTT TGCCCAGGAT CGGCTTGGTA TTCCGCGCGG CACGATCCGT
GCAACGGTGC TGATCGAGCA TATTCTTGCC GTCTTCGAAA TGGAAGAAAT CCTTTACGAA
TTGCGTGAGC ATAGCAGCGG CTTGAACCTG GGACGCTGGG ATTATATCTA CAGTTTTATT
AAGACGTTCA ACCATCGCAG CGACTGGATT TTTCCCGATC GGGCGCAGGT GACGATGACG
ACACATTTCC TGCGCTCAGC GGCGGAACTC CTGGTCTATT CCTGCCACAG GCGCGGCGCT
CACGCGCTCG GTGGTATGTC GGCGTTTATT CCGAACCGGC GTGAGCCGGA GATTACTGAA
CGCGCGCTGG CGCAGGTTCG CGCCGATAAG GAGCGTGAGG CAAAGCAGGG GTTCGATGGC
GCCTGGGTGG CGCATCCCGA CCTGGTGCCA ACTGTGCTTG AGGTCTTCAG TGCCGCGTAT
GAGGGCGATC ATCAGATCCA TTACGTGCCG CAGGTTCATG TCACTGCTGC CGATCTGCTG
ACCATTCCGC AGGGGACGAT TACCGAAGCG GGGTTGCGCA ACAATATCAC GGTTGCGTTG
CAGTACATCG AAGCGTGGTT GGGCGGTCGC GGCGCGGTGG CGATCTTCAA CTTGATGGAA
GATGTGGCGA CGGCGGAAAT TGCGCGCTCG CAACTCTGGC AGTGGGTGCG CTACAATGCC
CGCCTGGATG ATGGACGCAC CATTGATGAG ACAATGTATA AGACGATGCG CGACGAAGAA
TTGCACACGC TGGTCGCGGC GCGCACCGGT GATCATCATT TTGCGCTGGC GGCGGAATTG
CTCGATGAAC TGACGCTGTC GCGCGATTTT GTGGAGTTCC TGACCATCCC CGGCTATCGC
CGTCTGGATT GA
 
Protein sequence
MDTPQRVEIL GSTRPEWSEI LSVEALDFIA ALARQFEHRR RALLAAREQR WADIRGGALP 
DFLSETAEIR GSDWKVASIP ADFSNRRVEI TGPTDRRMVI NALNSGAQVF MADFEDANAP
TWENMVQGQL NLRDAVRRTI TFVSPEGREY RLNDTIATLA VRPRGWHLIE KHVHVDGEPV
AGAFFDFGLY FFHNARELIR RGSGPYFYLP KMQSHLEARL WNDVFNFAQD RLGIPRGTIR
ATVLIEHILA VFEMEEILYE LREHSSGLNL GRWDYIYSFI KTFNHRSDWI FPDRAQVTMT
THFLRSAAEL LVYSCHRRGA HALGGMSAFI PNRREPEITE RALAQVRADK EREAKQGFDG
AWVAHPDLVP TVLEVFSAAY EGDHQIHYVP QVHVTAADLL TIPQGTITEA GLRNNITVAL
QYIEAWLGGR GAVAIFNLME DVATAEIARS QLWQWVRYNA RLDDGRTIDE TMYKTMRDEE
LHTLVAARTG DHHFALAAEL LDELTLSRDF VEFLTIPGYR RLD