Gene Rcas_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1900 
Symbol 
ID5539378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2443794 
End bp2444900 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID640894037 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001432008 
Protein GI156741879 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0133232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC CCACAACCAT TCGCTCTCTG ACCGTGACCC CCCTTGACAT CCCGCTCCAC 
GAACCGTTCG GCATTGCCGG CGGCGCCCAG GAAATGGCGC ACAATCTGCT GATCACCGTC
GAGTTGACGG ACGGCACGCG CGGCTACGGC GAAGCAGCGC CGTTTCCGGC GTTCAACGGC
GAGACGCAGG AAACGGCGCG CGCGGCGATC CTGGCGGCGC AACCGATTGT CGAAGGCGCC
GACGCGCGCG AGTGGCGGCG GATTGCGCTG GCGCTCCCCG CTATTCCTGG CATGACCGGT
TCGGCGCGTT GCGCCATTGA GACTGCCCTG CTCGATGCGT TGACGCGCCG CGCGCGCCTG
CCGCTCTGGG CATTCTTCGG CGGCATGGCG ACGACCCTGG AAACGGATGT GACCGTCACC
ATCGGCAGCG TGGCGGCGGC GGCGCGCGCG GCGCAGGCGA TTGTCGCGCG CGGCGTCACC
ACGATCAAGA TCAAGATCGG CGCCGGCGAT CCAGAGACCA CCACGATCCG CACGATGGAA
CACGACCTGG CGCGGATTGT CGCTATCCGG GATGTCGCGC CGGATGCGCG CCTGATCCTC
GACGGCAACT GCGGCTATAC CGCCGATGAT GCGCTTCGAT TGCTCGATAT GCTGAGCGTT
CACGGCATTG TTCCGGTCCT GTTCGAGCAA CCGGTGGCGA AGGATGATGA AGCCGGATTG
CGTCGTCTGA CCGCCGCGCG CCGTGTGCCC ATCGCCGCCG ATGAAAGCGC ATCGAGCGCC
GCCGATGTTG CGCGGTTGGC GCAGTCCGGC GCCGTTGATG TGATCAACAT CAAATTGATG
AAATGTGGCA TTGTTGAAGC GCTCGACATC GCCGCGATTG CACGCGTTGC AGGCTTGCGC
CTGATGATCG GCGGAATGGT CGAGTCGTTT CTGGCAATGA CCGTCTCTGC CTGTTTTGCC
GCCGGTCAGG GCGGCTTCGC TTTCGTAGAC CTGGACACGC CCCTTTTTCT GGCAGAGAAC
CCCTTCGTCG GCGGTATGAC CTATCGTGGC GGCGTCATCG ATCTGGGGGC GATCCATGCC
GGGCACGGCG TCACGCTGCC TGCTTGA
 
Protein sequence
MTTPTTIRSL TVTPLDIPLH EPFGIAGGAQ EMAHNLLITV ELTDGTRGYG EAAPFPAFNG 
ETQETARAAI LAAQPIVEGA DAREWRRIAL ALPAIPGMTG SARCAIETAL LDALTRRARL
PLWAFFGGMA TTLETDVTVT IGSVAAAARA AQAIVARGVT TIKIKIGAGD PETTTIRTME
HDLARIVAIR DVAPDARLIL DGNCGYTADD ALRLLDMLSV HGIVPVLFEQ PVAKDDEAGL
RRLTAARRVP IAADESASSA ADVARLAQSG AVDVINIKLM KCGIVEALDI AAIARVAGLR
LMIGGMVESF LAMTVSACFA AGQGGFAFVD LDTPLFLAEN PFVGGMTYRG GVIDLGAIHA
GHGVTLPA