Gene Rcas_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2117 
Symbol 
ID5539597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2720149 
End bp2721390 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID640894251 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001432220 
Protein GI156742091 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.457647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGATCAA CGGTCATTAC CGCAGTCCAT GCCCGTGATG TTCGCTTTCC CTTAAAGCCG 
GGCGAGGGAG TGGACGCCAT CCATACCAAC CCGCAGTACG CCTACGCAGT CACGCTGCTC
GCCACAAATA CATCCCTGCG CGGCACGGGG CTGGCCTTCA CGCTTGGCGC CGGCACCGAA
CTGGTCTGCG ATGCTATCCG CATGCTGGCG CAGCCGCTGG AGGGACGCGA GATCGAAGAG
TTGATGGCCG ATTTCGGTCG TCTCACTCGT CAGATCGCCG ATCATCCACA GATGCGCTGG
CTTGGTCCCC ATAAAGGCGT TGTGCATCTG GCGCTTGCCT CGCTGACGAA TGCCTGTTTT
GATTTATGGG CTAAAGCGCG CGGCGTGCCG CTCTGGAAAC TGTTGCTCGA TTTGACGCCA
GAAGCGATCA TGGCGCTTCT TGACCTGAGT TACCTGGAAG ATGTCCTCAC CCCGTCTGAG
GCGATCAATA TGCTGTATCG TGAAATGACT CACCGCAACG AACGCGCAGC AATTCTGACG
CAGGGATATC CCGGCTATGA CACCTCGGTC GGCTGGTTTC ATTACGATGA TCGGCAACTG
ATCGAAAATG CGCGGCGTGC TGCGGATGCC GGTTTTTCGG CTATGAAACT GAAGGTCGGC
TCACCCGACC CAGCCCATGA TATTCGTCGG GCGCTACTGG TCCGCGAGAC GGTAGGACGC
GACGTGCGCA TCATGCTGGA CGCCAACCAG CAATGGACGT TGCCGATGGC GCTGCACGCC
TGTCAGGAAC TTGCATCGAT GCAACCATAC TGGATCGAGG AGCCGACCCA TCCCGATGAT
GTGATCGGAC ACCAAACGCT TGCGCGATCA ATTGCGCCGC TTCGGCTGGC AGTCGGCGAA
CACCTTCCCA ATCGAGTGGT CTTCAAAAAC TATATGCAGG CCAATGCTGC TCATTTCATT
CAAGCAGACT GCACGCGCGT CGGCGGGGTT AGCGAGTTCA TCACGGTGAG TCTGCTCGCC
AGGCGCTTCA ACCTGCCGGT AGCGCCACAC GTCGGGGATA TGGGACAAAT TCATCAGCAC
CTGACACTCT TCAACCGGAT TGCGCTGGGA CACGAGACCG TCTTTCTTGA GTATATCCCG
CACCTGCGCG ATCGCTTCCG CTACCCGGCA CAGGTTGAAG ATGGCGTCTA TCGCACACCA
CAGGAGCCAG GCAGCAGCGC CGATTTAATC GATTGCACCT GA
 
Protein sequence
MRSTVITAVH ARDVRFPLKP GEGVDAIHTN PQYAYAVTLL ATNTSLRGTG LAFTLGAGTE 
LVCDAIRMLA QPLEGREIEE LMADFGRLTR QIADHPQMRW LGPHKGVVHL ALASLTNACF
DLWAKARGVP LWKLLLDLTP EAIMALLDLS YLEDVLTPSE AINMLYREMT HRNERAAILT
QGYPGYDTSV GWFHYDDRQL IENARRAADA GFSAMKLKVG SPDPAHDIRR ALLVRETVGR
DVRIMLDANQ QWTLPMALHA CQELASMQPY WIEEPTHPDD VIGHQTLARS IAPLRLAVGE
HLPNRVVFKN YMQANAAHFI QADCTRVGGV SEFITVSLLA RRFNLPVAPH VGDMGQIHQH
LTLFNRIALG HETVFLEYIP HLRDRFRYPA QVEDGVYRTP QEPGSSADLI DCT