Gene Rcas_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3848 
Symbol 
ID5541352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5029785 
End bp5030951 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID640895958 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001433903 
Protein GI156743774 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.161688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CCCGCGTTTC GACCCTCGTT GTGCACGCAC GGATGCGCAA CTGGGTTTTC 
GTCAAAGTCG AAACCGATCA GGATGGACTC TATGGCTGGG GCGAGGCGAC ACTCGAATGG
AAGACCAAAG GCGTGGTCGG CGCCGTTGAG GATGTCAGTC GCCTGATCAT CGGTGAAGAC
CCCCGGCGCA TCGAACACCT GTATCAGATG ATGACCCGTC AGTACTTCTG GCGCGCCGGC
ATCGAGGGGA TGACCGCCAT GAGCGGCATC GAGCAGGCGC TCTGGGATAT CAAAGGCAAG
TGGTTGAACG TGCCGGTCTA CGAGTTACTC GGCGGGCGCG TTCGAGACCG GATACGGGTC
TATAACCATC TTGGCGGCGG TACGATGGAG CACATGTACG AAACGACCGA CCCCGACTTG
TTTGCCGAGC GCGCGTTGAT GGTCAAAGAG CAAGGGTATA CAGCGCTCAA GTTTATGGCA
GTGCCGCGGA CGGAACCTGT TGAGGGCATG CGTCCGGTGC GGTATGCCGA GCGTCTGGTG
CGCGCCGTGC GCGACGCGGT CGGCGATGAT GTCGATCTGA TGGTCGATCT CCACGCCCGG
TGCGCTTCTC CGGCGATGGC GCTGCGCTAC TGCCGCGCGT TCGAGCCGTA TGGGTTGCTC
TTCTTCGAAG AGCCTTGCCC CAGCGAGGAT ATCGAAGCCA CCGCTCAGGT CACCCGCGCC
TCGAACATTC CGATCGCTAC CGGCGAACGC CTGGTCGGGC GACATCAATT CCGCGAAGTG
TTCGAGCGGC GCGCCTGCCA TATCATTCAG CCCGATCTGT CGCACTGCGG CGGTCTGTGG
GAAGCGCGCA AGATCGCGGC AATCGCCGAA ACGTACTCGA TGGCCGTTGC GCCGCATAAC
CCGAATGGAC CGATTGCGAC GGCGGCTGCC ATTCATTTCG CCGCCGCCAC GCCAAACTGG
GTGATTCAGG AAGCGATCAG CAACGATGTC CCCTGGCGAT ACGATGTGGT CGACTCAGAG
CATCAGGTTC GGGATGGGTA TATTGCCATA CCCTCACGCC CCGGTTTGGG CGTCGAGGTG
AACGAGGGCG AGGCGGCGCG CCACCCCTGG CAGCCGGAAC TGGTGCAACG CTATTTCCAT
CCCGACGGTT CCGTGGCTGA TTGGTAG
 
Protein sequence
MKITRVSTLV VHARMRNWVF VKVETDQDGL YGWGEATLEW KTKGVVGAVE DVSRLIIGED 
PRRIEHLYQM MTRQYFWRAG IEGMTAMSGI EQALWDIKGK WLNVPVYELL GGRVRDRIRV
YNHLGGGTME HMYETTDPDL FAERALMVKE QGYTALKFMA VPRTEPVEGM RPVRYAERLV
RAVRDAVGDD VDLMVDLHAR CASPAMALRY CRAFEPYGLL FFEEPCPSED IEATAQVTRA
SNIPIATGER LVGRHQFREV FERRACHIIQ PDLSHCGGLW EARKIAAIAE TYSMAVAPHN
PNGPIATAAA IHFAAATPNW VIQEAISNDV PWRYDVVDSE HQVRDGYIAI PSRPGLGVEV
NEGEAARHPW QPELVQRYFH PDGSVADW