Gene Rcas_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2553 
Symbol 
ID5540035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3293036 
End bp3294340 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content59% 
IMG OID640894682 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001432649 
Protein GI156742520 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00423764 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.461549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA CCATCACCCG CGCCGAAGCG CGCGACGTTC GCTTCCCCAC TTCCCGCACT 
CTCGATGGAT CGGACGCCAT GAACCCCGAC CCCGACTACT CGGCGGCATA TGTGATTCTC
CACACAAACG TCCCTGGTCT GACGGGGCAC GGGCTGACCT TTACCATCGG GCGTGGCAAT
GAATTGTGCG TCGCCGCATG TCAGGTTTTG CTGCCGATGG TGGTGAACCG TTCCCTTGAG
TCAATCACTG CTGACATGGG CGCGTTCTGG CATATGATCA CCGGCGACAG CCAGTTGCGC
TGGATCGGAC CGGAGAAGGG CGTGATTCAT CTGGCGACTG CCGCCGTGGT CAATGCGGTT
TGGGACCTGT GGGCAAAGGT TGAGCAAAAA CCGCTCTGGA AGTTGTTGAG CGATATGTCG
CCGGAAGAAC TGGTGCGCTG CATCGATTTT CGCTACATTT CCGATGCGCT GACGCCTGAT
GAGGCGCGTG ACATTCTGCG CCGCCAGGAG GCGACGCGCG CCAAACGCGA GGCGGAAATG
CGCACGCACG GGTTTCCTGC CTATACGACA TCGGCGGGTT GGATCGGGTA TTCCGACGAC
AAAGTGCGCC GGTTGTGCCG GGAAGCGATC GATGCCGGGT TCCAGCACAT CAAAATGAAG
GTTGGACGTG ATCTCGATGC CGACCGGCGC CGCGCCCGGT TGATCCGCGA GATCATTGGA
CCGGATCGCA AATTAATGGC AGATGCCAAC CAGGTGTGGG ATGTGCCGCA GGCGATTGCC
TGGATGCGCG ACCTTGCAGA ATTCGACCTC TGGTGGATCG AGGAGCCAAC CAGCCCCGAC
GATATTCTGG GTCATGCGGC GATTGCCCGC GCTGTGGCGC CGGTTGGCGT GGCAACCGGC
GAGCATGTCC AGAACCGCAT TGTCTTCAAA CAACTGTTGC AGATGAATGC CATCAATTTC
TGTCAGATCG ATGCCTGCCG CCTCGGCGGG GTCAACGAGG TGTTGGCGGT TATCCTGATG
GCCGCAAAGT TTGGCGTACC GGTCTGCCCG CATGCTGGCG GTGTCGGGTT GTGCGAGTAT
GTCCAACATC TGTCGATCTG GGATTACATC TGCGTTTCCG CATCGCTGGA GAATCGTGTG
ATTGAATACG TCGATCATCT GCACGAGCAC TTTCTCGATC CGGTTGTCAT CCGCAATGCT
CGCTACATGC CGCCGCAGAC GCCAGGATAC AGCATCGAAA TGAAACCGGA GTCGCTGGCA
ATGTATGAGT ATCCTCATGG TGCGGCATGG AGCAATCTCG GTTAA
 
Protein sequence
MAITITRAEA RDVRFPTSRT LDGSDAMNPD PDYSAAYVIL HTNVPGLTGH GLTFTIGRGN 
ELCVAACQVL LPMVVNRSLE SITADMGAFW HMITGDSQLR WIGPEKGVIH LATAAVVNAV
WDLWAKVEQK PLWKLLSDMS PEELVRCIDF RYISDALTPD EARDILRRQE ATRAKREAEM
RTHGFPAYTT SAGWIGYSDD KVRRLCREAI DAGFQHIKMK VGRDLDADRR RARLIREIIG
PDRKLMADAN QVWDVPQAIA WMRDLAEFDL WWIEEPTSPD DILGHAAIAR AVAPVGVATG
EHVQNRIVFK QLLQMNAINF CQIDACRLGG VNEVLAVILM AAKFGVPVCP HAGGVGLCEY
VQHLSIWDYI CVSASLENRV IEYVDHLHEH FLDPVVIRNA RYMPPQTPGY SIEMKPESLA
MYEYPHGAAW SNLG